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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANTS: Reed, Steven G. 

Skeiky, Yasir A.W. 
Dillon, Davin C. 
Campos -Neto , Antonio 
Hough t on , Raymond 
Vedvick , Thomas S . 
Twardzik, Daniel R. 
Lodes, Michael J. 
Hendrickson , Ronald 

(ii) TXTLE OF XNYENTION: COMPOUNDS AMD METHODS FOR IMMUNOTHERAPY 
AND DIAGNOSIS OF TUBERCULOSIS 

(iii) NUMBER OF SEQUENCES: 355 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: SEED and BERRY LLP 

(B) STREET: 6300 Columbia Center, 701 Fifth Avenue 

(C) CITY: Seattle 

(D) STATE: Washington 

(E) COUNTRY: USA 

(F) ZIP: 98104-7092 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER:. IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 05-MAY-1998 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Maki , David J. 

(B) REGISTRATION NUMBER: 31,392 

(C) REFERENCE /DOCKET NUMBER: 210121. 411C9 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (206) 622-4900 

(B) TELEFAX: (206) 682-6031 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 766 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 



Q 
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(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

CGAGGCACCG GTAGTTTGAA CCAAACGCAC AATCGACGGG CAAACGAACG GAAGAACACA 60 

ACCATGAAGA TGGTGAAATC GATCGC CGCA GGTCTGACCG CCGCGGCTGC AATCGGCGCC 120 

GCTGCGGCCG GTGTGACTTC GATCATGGCT GGCGGCCCGG TCGTATACCA GATGCAGCCG 180 

GTCGTCTTCG GCGCGCCACT GCCGTTGGAC CCGGCATCCG CCCCTGACGT CCCGACCGCC 240 

GCCCAGTTGA CCAGCCTGCT CAACAGCCTC GCCGATCCCA ACGTGTCGTT TGCGAACAAG 300 

GGCAGTCTGG TCGAGGGCGG CATCGGGGGC ACCGAGGCGC GCATCGCCGA CCACAAGCTG 360 

AAGAAGGCCG CCGAGCACGG GGATCTGCCG CTGTCGTTCA GCGTGACGAA CATCCAGCCG 420 

GCGGCCGCCG GTTCGGCCAC CGCCGACGTT TCCGTCTCGG GTCCGAAGCT CTCGTCGCCG 480 

GTCACGCAGA ACGTCACGTT CGTGAATCAA GGCGGCTGGA TGCTGTCACG CGCATCGGCG 540 

ATGGAGTTGC TGCAGGCCGC AGGGNAACTG ATTGGCGGGC CGGNTTCAGC CCGCTGTTCA 600 

GCTACGCCGC CCGCCTGGTG ACGCGTCCAT GTCGAACACT CGCGCGTGTA GCACGGTGCG 660 

GTNTGCGCAG GGNCGCACGC ACCGCCCGGT GCAAGCCGTC dCGAGATAG GTGGTGNCTC 720 

GNCACCAGNG ANCACCCCCN NNTCGNCNNT TCTCGNTGNT GNATGA 766 

(2) INFORMATION FOR SEQ ID NO : 2 : 

G3 (i) SEQUENCE CHARACTERISTICS: 

03 (A) LENGTH: 752 base pairs 

4* (B) TYPE: nucleic acid 

CO (C) STRAND EDNESS : single 

*P (D) TOPOLOGY: linear 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

□ 

fU ATGCATCAC C ATCAC CATCA CGATGAAGTC ACGGTAGAGA CGACCTCCGT CTTCCGCGCA 60 

fy GACTTCCTCA GCGAGCTGGA CGCTCCTGCG CAAGCGGGTA CGGAGAGCGC GGTCTCCGGG 120 

Lq GTGGAAGGGC TCCCGCCGGG CTCGGCGTTG CTGGTAGTCA AACGAGGCCC CAACGCCGGG 180 

p TCCCGGTTCC TACTCGACCA AGCCATCACG TCGGCTGGTC GGCATCCCGA CAGCGACATA 240 

fy TTTCTCGACG ACGTGACCGT GAGCCGTCGC CATGCTGAAT TCCGGTTGGA AAACAACGAA 300 

TTCAATGTCG TCGATGTCGG GAGTCTCAAC GGCACCTACG TCAACCGCGA GCCCGTGGAT 360 
TCGGCGGTGC TGGCGAACGG CGACGAGGTC CAGATCGGCA AGCTCCGGTT *GGTGTTCTTG 420 
ACCGGACCCA AGCAAGGCGA GGATGACGGG AGTACCGGGG GCCCGTGAGC GCACCCGATA 480 
GCCCCGCGCT GGCCGGGATG TCGATCGGGG CGGTCCTCCG ACCTGCTACG AC CGGATTTT 540 
CCCTGATGTC CACCATCTCC AAGATTCGAT TCTTGGGAGG CTTGAGGGTC NGGGTGACCC 600 
CCCCGCGGGC CTCATTCNGG GGTNTCGGCN GGTTTCACCC CNTACCNACT GCCNCCCGGN 660 
TTGCNAATTC NTTCTTCNCT GCC CNNAAAG GGACCNTTAN CTTGCCGCTN GAAANGGTNA 720 
TCCNGGGCCC NTCCTNGAAN CCCCNTCCCC CT 752 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 813 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

CATATGCATC ACCATCACCA TCACACTTCT AACCGCCCAG CGCGTCGGGG GCGTCGAGCA 60 
CCACGCGACA CCGGGCCCGA TCGATCTGCT AGCTTGAGTC TGGTCAGGCA TCGTCGTCAG 120 
CAGCGCGATG CCCTATGTTT GTCGTCGACT CAGATATCGC GGCAATCCAA TCTCCCGCCT 180 
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GCGGCCGGCG GTGCTGCAAA CTACTCCCGG AGGAATTTCG ACGTGCGCAT CAAGATCTTC 240 

ATGCTGGTCA CGGCTGTCGT TTTGCTCTGT TGTTCGGGTG TGGCCACGGC CGCGCCCAAG 300 

ACCTACTGCG AGGAGTTGAA AGGCACCGAT ACCGGCCAGG CGTGCCAGAT TCAAATGTCC 360 

GACCCGGCCT ACAACATCAA CATCAGCCTG CCCAGTTACT ACCCCGACCA GAAGTCGCTG 420 

GAAAATTACA TCGCCCAGAC GCGCGACAAG TTCCTCAGCG CGGCCACATC GTCCACTCCA 480 

CGCGAAGCCC CCTACGAATT GAATATCACC TCGGCCACAT ACCAGTCCGC GATACCGCCG 540 

CGTGGTACGC AGGCCGTGGT GCTCAMGGTC TACCACAACG CCGGCGGCAC GCACCCAACG 600 

ACCACGTACA AGGCCTTCGA TTGGGACCAG GCCTATCGCA AGCCAATCAC CTATGACACG 660 

CTGTGGCAGG CTGACACCGA TCCGCTGCCA GTCGTCTTCC CCATTGTTGC AAGGTGAACT 720 

GAGCAACGCA GACCGGGACA ACWGGTATCG ATAGCCGCCN AATGCCGGCT TGGAACCCNG 780 

TGAAATTATC ACAACTTCGC AGTCACNAAA NAA 813 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH? 44~ base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

CGGTATGAAC ACGGCCGCGT CCGATAACTT CCAGCTGTCC CAGGGTGGGC AGGGATTCGC 60 

CATTCCGATC GGGCAGGCGA TGGCGATCGC GGGCCAGATC CGATCGGGTG GGGGGTCACC 120 

CACCGTTCAT ATCGGGCCTA CCGCCTTCCT CGGCTTGGGT GTTGTCGACA ACAACGGCAA 180 

CGGCGCACGA GTCCAACGCG TGGTCGGGAG CGCTCCGGCG GCAAGTCTCG GCATCTCCAC 240 

CGGCGACGTG ATCACCGCGG TCGACGGCGC TCCGATCAAC TCGGCCACCG CGATGGCGGA 300 

CGCGCTTAAC GGG CAT CATC CCGGTGACGT CATCTCGGTG AACTGGCAAA CCAAGTCGGG 360 

CGGCACGCGT ACAGGGAACG TGACATTGGC CGAGGGACCC CCGGCCTGAT TTCGTCGYGG 420 

ATACCACCCG CCGGCCGGCC AATTGGA 447 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 604 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

GTCCCACTGC GGTCGCCGAG TATGTCGCCC AGCAAATGTC TGGCAGCCGC CCAACGGAAT 60 

CCGGTGATCC GACGTCGCAG GTTGTCGAAC CCGCCGCCGC GGAAGTATCG GTCCATGCCT 120 

AGCCCGGCGA CGGCGAGCGC CGGAATGGCG CGAGTGAGGA GGCGGGCAAT TTGGCGGGGC 180 

CCGGCGACGG NGAGCGCCGG AATGGCGCGA GTGAGGAGGT GGNCAGTCAT GCCCAGNGTG 240 

ATCCAATCAA CCTGNATTCG GNCTGNGGGN CCATTTGACA ATCGAGGTAG TGAGCGCAAA 3 00 

TGAATGATGG AAAACGGGNG GNGACGTCCG NTGTTCTGGT GGTGNTAGGT GNCTGNCTGG 360 

NGTNGNGGNT ATCAGGATGT TCTTCGNCGA AANCTGATGN CGAGGAACAG GGTGTNCCCG 42 0 

NNANNCCNAN GGNGTCCNAN CCCNNNNTCC TCGNCGANAT CANANAGNCG NTTGATGNGA 480 

NAAAAGGGTG GANCAGNNNN AANTNGNGGN CCNAANAANC NNNANNGNNG NNAGNTNGNT 540 

NNNTNTTNNC ANNNNNNNTG NNGNNGNNCN NNNCAANCNN NTNNNNGNAA NNGGNTTNTT 600 

NAAT 6 04 



(2) INFORMATION FOR SEQ ID NO: 6: 



74 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 633 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 
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TTGCANGTCG 
CGCTCTAGAA 
TAACGGTCCT 
CGGGTGCGAA 
CGGGATCGGT 
ATCCTCACCG 
GCAGACCGCG 
CGCGATCAAC 
GATCGCCTCC 
CTGGGGACTC 
CCTGACCGGC 



AACCACCTCA 
CTAGTGKATM 
GTTACGGTGA 
CCCTCACCCT 
TTTTCGCGGY 
GYGCCGATCC 
CTGCCGGACA 
GGCGGGGGGG 
GAGCACGCCC 
AGTGTGTGCT 
GACTACCTGT 



CTAAAGGGAA 
YYYCKGGCTG 
TCGAATGACC 
CAACCGGCCG 
GTTGGYCGAC 
GGTGTTCTGC 
TCTCACCGCG 
TGAGGGGGGG 
GCTTCGNCGA 
TGCCGCAAAA 
CCGTGACCGA 



CAAAAGCTNG 
CAGSAATYCG 
GACGACATCC 
CAGTCCCGYA 
GCCGAGGYCG 
GCCGGACTGG 
GTGGGCGGCC 
GCTCGAACTG 
CACCCACGCC 
GGTCGGCATC 
CGC 



AGCTCCACCG 
GYACGAGCAT 
TGCTGATCGA 
ACGCGCTCTC 
ACGACGACAT 
ACCTCAAGGT 
ATGACCAAGC 
GCGCTGTACT 
CGGGTGGGGC 
GGNCTGGGCC 



CGGTGGCGGC 
TAGGACAGTC 
CACCGACGAA 
GGCGGCGCTA 
CGACGTCGTC 
AGCTGGCCGG 
CGGTGATCGG 
GCGACATCCT 
TGCTGCCCAC 
GGTGGATGAG 



(2) INFORMATION FOR SEQ ID NO: 7: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1362 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 



CGACGACGAC 
CGGCACCACC 
CCCCGCCGAG 
CGGCTGCCCG 
GCGACGTTGC 
GCCGCCGCCG 
CTGTACGCGG 
GCCGGTGACC 
CCGCCGGCAC 
CACTTCATCG 
CGCGCCCAAC 
GAGCATCGGC 
GCATGGGCAA 
GACACCGCGC 
TGGCACGGCG 
CCCGCCGACC 
GTGACCGACG 
GGCGCCCTGG 
GCCGCCGAGG 
TAGGGTGTCA 
GGAAGCTGCG 
GGCCGATCAG 
GGACCGGACG 



GGCGCCGGAG 
CAGGAGGGAG 
CCGGCGGCGC 
AGCCGCTCGC 
GCGAGACACT 
TCGCGGCCAG 
CAGGCCAAAC 
CGAACGCGCC 
CGTTCGGCCC 
CACGCCTGGT 
AGCTCATGCG 
CGGGCCGCTC 
CACCGTCCGA 
CGCACCTGCC 
AGCCAATGCC 
TGCACGCGCC 
ACGACGTCGC 
CCTGGGCCGC 
GCCAGGTGTC 
TCGCTGGCCC 
CCCACTGGCT 
GTCGGCGCCG 
GTCACCGGGG 



AGCGGGCGCG 
TCGAATCATG 
GGTCGCCGAG 
CATGCTGTCC 
GCTGGTGGGC 
CCTGCGCTGC 
CGACACCGCC 
GTATGTGGCG 
GGATGTCGCC 
CCTGGTGCTG 
CCGCGCCGGT 
CACCCGCCGG 
GCCCATAGCA 
GCCACCGACT 
GATGAGCAGT 
CACCCGTCTT 
CGCGGCCCGA 
CTTCACCGCC 
GCGGCAAAAC 
GAGGGATCTC 
TGCGCCCCAA 
GCCCTTGGCC 
GTCACCCTGC 



AACGGCGATC 
AAATTTGTCA 
GTCTATGCCG 
CCGGACGAGG 
CAGGTGCCGC 
CCCTGGTGCG 
GCGGCGATCT 
TGGGCGGCAG 
GCCGAATACC 
CTGGACGAAA 
GGACTGGTGT 
CTCGAGCCGC 
ACCGCGTTCG 
CGTCAGGTGG 
CGCTGGACGA 
GCCCTGCTGA 
TCCCTGCTCG 
GCGCGGCGCA 
CCGACTGGGT 
GCGGCGGCGA 
CGCCGTCGTG 
GAAGGTC CAG 
GCGCCCAAGG 



GACGCGGCCC 
AC CAT ATTG A 
AGGCCCGCCG 
GACTGCTCAC 
GTGGCCGCAA 
TCGACGCACA 
TGGCCGGCAC 
GAACCGGGAC 
TGGGCACCGC 

TCGCCCGCAA 
GAACGCTGCC 
CCGCGCTCAG 
TCAGGCGGGT 
ACGAGCACAC 
CCGGCCTGGC 
AC AC CGATGC 
TCGGCACCTG 
GAGTGTGCGC 
ACGGAGGTGG 
GGCGTTCGGT 
CTCAACGTGC 
AA 



TGGCCAGAGT 
GCCCGTCGCG 
CGAGTTCGGC 
CGCCGGCTGG 
GGAAGCCGTC 
CACCACCATG 
AGCACCTGCC 
ACCGGCGGGA 
GGTGCAATTC 
GGGGGGCCCG 
GGTGCGCGCG 
CGACGATCTG 
CCACCACCTG 
CGTGGGGTCG 
CGCCGAGCTG 
CCCGCATCAG 
GGCGCTGGTT 
GATCGGCGCC 
GCCCTGTCGG 
CGACACAGGT 
TGGCCGCACT 
CGTCACCGAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
633 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1362 



(2) INFORMATION FOR SEQ ID NO : 8 : 



75 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 58 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

GCGACGACCC CGATATGCCG GGCACCGTAG CGAAAGCCGT CGCCGACGCA CTCGGGCGCG 60 

GTATCGCTCC CGTTGAGGAC ATTCAGGACT GCGTGGAGGC CCGGCTGGGG GAAGCCGGTC 120 

TGGATGACGT GGCCCGTGTT TACATCATCT ACCGGCAGCG GCGCGCCGAG CTGCGGACGG 180 

CTAAGGCCTT GCTCGGCGTG CGGGACGAGT TAAAGCTGAG CTTGGCGGCC GTGACGGTAC 240 

TGCGCGAGCG CTATCTGCTG CACGACGAGC AGGGCCGGCC GGCCGAGTCG ACCGGCGAGC 300 

TGATGGACCG ATCGGCGCGC TGTGTCGCGG CGGCCGAGGA CCAGTATGAG CCGGGCTCGT 360 

CGAGGCGGTG GGCCGAGGGG TTCGGCACGC TATTACGCAA CCTGGAATTC CTGCCGAATT 420 

CGCCCACGTT GATGAACTCT GGCACCGACC TGGGACTGCT CGCCGGCTGT TTTGTTCTGC 480 

M 3 CGATTGAGGA TTCGCTGCAA TCGATCTTTG CGACGCTGGG ACAGGCCGCC GAGCTGCAGC 54 0 

O GGGCTGGAGG CGGCACCGGA TATGCGTTCA GCCACCTGCG ACCCGCCGGG GATCGGGTGG 600 

O CCTCCACGGG CGGCACGGCC AGCGGACCGG TGTCGTTTCT ACGGCTGTAT GACAGTGCCG 660 

£0 CGGGTGTGGT CTCCATGGGC GGTCGCCGGC GTGGCGCCTG TATGGCTGTG CTTGATGTGT 720 

=p CGCACCCGGA TATCTGTGAT TTCGTCACCG CCAAGGCCGA ATCCCCCAGC GAGCTCCCGC 780 

m ATTTCAACCT ATCGGTTGGT GTGACCGACG CGTTCCTGCG GGCCGTCGAA CGCAACGGCC 840 

Jp! TACACCGGCT GGTCAATCCG CGAAC CGGCA AGATCGTCGC GCGGATGCCC GCCGCCGAGC 900 

|jj TGTTCGACGC CATCTGCAAA GCCGCGCACG CCGGTGGCGA TCCCGGGCTG GTGTTTCTCG 960 

- ACACGATCAA TAGGGCAAAC CCGGTGCCGG GGAGAGGCCG CATCGAGGCG ACCAACCCGT 102 0 

Q GCGGGGAGGT CCCACTGCTG CCTTACGAGT CATGTAATCT CGGCTCGATC AACCTCGCCC 1080 

ry GGATGCTCGC CGACGGTCGC GTCGACTGGG ACCGGCTCGA GGAGGTCGCC GGTGTGGCGG 1140 

TGCGGTTCCT TGATGACGTC ATCGATGTCA GCCGCTACCC CTTCCCCGAA CTGGGTGAGG 1200 

CGGCCCGCGC CACCCGCAAG ATCGGGCTGG GAGTCATGGG TTTGGCGGAA CTGCTTGCCG 1260 



iy 
01 

~j CACTGGGTAT TCCGTACGAC AGTGAAGAAG CCGTGCGGTT AGCCACCCGG CTCATGCGTC 1320 

GCATACAGCA GGCGGCGCAC ACGGCATCGC GGAGGCTGGC CGAAGAGCGG GGCGCATTCC 1380 

CGGCGTTCAC CGATAGCCGG TTCGCGCGGT CGGGCCCGAG GCGCAACGCA CAGGTCACCT 1440 

CCGTCGCTCC GACGGGCA 1458 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 862 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

ACGGTGTAAT CGTGCTGGAT CTGGAACCGC GTGGCCCGCT ACCTACCGAG ATCTACTGGC 60 

GGCGCAGGGG GCTGGCCCTG GGCATCGCGG TCGTCGTAGT CGGGATCGCG GTGGCCATCG 120 

TCATCGCCTT CGTCGACAGC AGCGCCGGTG CCAAACCGGT CAGCGCCGAC AAGCCGGCCT 180 

CCGCCCAGAG CCATCCGGGC TCGCCGGCAC CCCAAGCACC CCAGCCGGCC GGGCAAACCG 240 

AAGGTAACGC CGCCGCGGCC CCGCCGCAGG GCCAAAACCC CGAGACACCC ACGCCCACCG 300 

CCGCGGTGCA GCCGCCGCCG GTGCTCAAGG AAGGGGACGA TTGCCCCGAT TCGACGCTGG 360 

CCGTCAAAGG TTTG AC CAAC GCGCCGCAGT ACTACGTCGG CGACCAGCCG AAGTTCACCA 420 

TGGTGGTCAC CAACATCGGC CTGGTGTCCT GTAAACGCGA CGTTGGGGCC GCGGTGTTGG 480 

CCGCCTACGT TTACTCGCTG GACAACAAGC GGTTGTGGTC CAACCTGGAC TGCGCGCCCT 540 

CGAATGAGAC GCTGGTCAAG ACGTTTTCCC CCGGTGAGCA GGTAACGACC GCGGTGACCT 600 
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GGACCGGGAT GGGATCGGCG CCGCGCTGCC CATTGCCGCG GCCGGCGATC GGGCCGGGCA 660 

CCTACAATCT CGTGGTACAA CTGGGCAATC TGCGCTCGCT GCCGGTTCCG TTCATCCTGA 720 

ATCAGCCGCC GCCGCCGCCC GGGCCGGTAC CCGCTCCGGG TCCAGCGCAG GCGCCTCCGC 780 

CGG AGTCTC C CGCGCAAGGC GGATAATTAT TGATCGCTGA TGGTCGATTC CGCCAGCTGT 840 

GACAACCCCT CGCCTCGTGC CG 862 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 622 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

TTGATCAGCA CCGGCAAGGC GTCACATGCC TCCCTGGGTG TGCAGGTGAC CAATGACAAA 60 

; GACACCCCGG GCGCCAAGAT CGTCGAAGTA GTGGCCGGTG GTGCTGCCGC GAACGCTGGA 120 

Q GTGCCGAAGG GCGTCGTTGT CACCAAGGTC GACGACCGCC CGATCAACAG CGCGGACGCG 180 

G3 TTGGTTGCCG CCGTGCGGTC CAAAGCGCCG GGCGCCACGG TGGCGCTAAC CTTTCAGGAT 240 

■GO CCCTCGGGCG GTAGCCGCAC AGTGCAAGTC ACCCTCGGCA AGGCGGAGCA GTGATGAAGG 300 

4=> TCGCCGCGCA GTGTTCAAAG CTCGGATATA CGGTGGCACC CATGGAACAG CGTGCGGAGT 360 

00 TGGTGGTTGG CCGGGCACTT GTCGTCGTCG TTGACGATCG CACGGCGCAC GGCGATGAAG 420 

HF ACCACAGCGG GCCGCTTGTC AC CG AGCTGC TCACCGAGGC CGGGTTTGTT GTCGACGGCG 480 

Ltl TGGTGGCGGT GTCGGCCGAC GAGGTCGAGA TCCGAAATGC GCTGAACACA GCGGTGATCG 540 

a GCGGGGTGGA CCTGGTGGTG TCGGTCGGCG GGACCGGNGT GACGNCTCGC GATGTCACCC 600 

O CGGAAGCCAC CCGNGACATT CT 622 

ru 

fU (2) INFORMATION FOR SEQ ID NO: 11: 

m 

P (i) SEQUENCE CHARACTERISTICS: 

pi (A) LENGTH: 1200 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

GGCGCAGCGG TAAGC CTGTT GGCCGCCGGC ACACTGGTGT TGACAGCATG CGGCGGTGGC 60 

ACCAACAGCT CGTCGTCAGG CGCAGGCGGA ACGTCTGGGT CGGTGCACTG CGGCGGCAAG 120 

AAGGAGCTCC ACTCCAGCGG CTCGACCGCA CAAGAAAATG CCATGGAGCA GTTCGTCTAT 180 

GCCTACGTGC GATCGTGCCC GGGCTACACG TTGGACTACA ACGCCAACGG GTCCGGTGCC 240 

GGGGTGACCC AGTTTCTCAA CAACGAAACC GATTTCGCCG GCTCGGATGT CCCGTTGAAT 300 

CCGTCGACCG GTCAACCTGA CCGGTCGGCG GAGCGGTGCG GTTCCCCGGC ATGGGACCTG 360 

CCGACGGTGT TCGGCCCGAT CGCGATCACC TACAATATCA AGGGCGTGAG CACGCTGAAT 420 

CTTGACGGAC CCACTACCGC CAAGATTTTC AACGGCACCA TCACCGTGTG GAATGATCCA 480 

CAGATCCAAG CCCTCAACTC CGGCACCGAC CTGCCGCCAA CACCGATTAG CGTTATCTTC 540 

CGCAGCGACA AGTCCGGTAC GTCGGACAAC TTC CAGAAAT ACCTCGACGG TGTATCCAAC 600 

GGGGCGTGGG GCAAAGGCGC CAGCGAAACG TTCAGCGGGG GCGTCGGCGT CGGCGCCAGC 660 

GGGAACAACG GAACGTCGGC CCTACTGCAG ACGACCGACG GGTCGATCAC CTACAACGAG 720 

TGGTCGTTTG CGGTGGGTAA GCAGTTGAAC ATGGCCCAGA TCATCACGTC GGCGGGTCCG 780 

GATCCAGTGG CGATCACCAC CGAGTCGGTC GGTAAGACAA TCGCCGGGGC CAAGATCATG 840 

GGACAAGGCA ACGACCTGGT ATTGGACACG TCGTCGTTCT ACAGACCCAC CCAGCCTGGC 900 

TCTTACCCGA TCGTGCTGGC GACCTATGAG ATCGTCTGCT CGAAATACCC GGATGCGACG 960 

ACCGGTACTG CGGTAAGGGC GTTTATGCAA GCCGCGATTG GTCCAGGCCA AGAAGGCCTG 1020 
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GACCAATACG GCTCCATTCC GTTGGCCAAA TCGTTCCAAG CAAAATTGGC GGCCGCGGTG 1080 

AATGCTATTT CTTGACCTAG TGAAGGGAAT TCGACGGTGA GCGATGCCGT TCCGCAGGTA 114 0 

GGGTCGCAAT TTGGGCCGTA TCAGCTATTG CGGCTGCTGG GCCGAGGCGG GATGGGCGAG 1200 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1155 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 12 : 

CTGTTCGACG AACTGGGCAT GCCGAAGACC AAACGCACCA 60 

GCeGACGCGC TGCAGTCGTT GTTCGACAAG ACCGGGCATC 120 

GCCCACCGCG ACGTCACCCG GCTCAAGGTC ACCGTCGACG 180 

GCCGACGGCC GCATCCACAC CACGTTCAAC CAGACGATCG 240 

TCGACCGAAC CCAACCTGCA GAACATCCCG ATCCGCACCG 300 

GACGCGTTCG TGGTCGGGGA CGGTTACGCC GAGTTGATGA " 360 

GAGATGCGGA TCATGGGGCA CCTGTCCGGG GACGAGGGCC 420 

GGGGAGGACC TGTATTCGTT CGTCGCGTCC CGGGTGTTCG 480 

ACCGGCGAGT TGCGGCGCCG GGTCAAGGCG ATGTCCTACG 540 

GCCTACGGCC TGTCGCAGCA GTTGAAAATC TCCACCGAGG 600 

GCGTATTTCG CCCGATTCGG CGGGGTGCGC GACTACCTGC 660 

CGCAAGGACG GCTACACCTC GACGGTGCTG GGCCGTCGCC 720 

AGCAGCAACC GTCAAGTGCG GGAGGCCGCC GAGCGGGCGG 780 

GGCAGCGCGG CCGACATCAT CAAGGTGGCC ATGATCCAGG 840 

GCACAGCTGG CGTCGCGCAT GCTGCTGCAG GTCCACGACG 900 

CCCGGTGAAC GCGAGCGGGT CGAGGCCCTG GTGCGCGACA 960 

CTCGACGTCC CGCTGGAGGT GTCGGTGGGC TACGGCCGCA 1020 

TGAGTGCCGA GCGTGCATCT GGGGCGGGAA TTCGGCGATT 1080 

TCGGCGCAAT CGGGACCGAG TTTGTCCAGC GTGTACCCGT 1140 

CGAGTAGCCT CGTCA 1155 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1771 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

GAGCGCCGTC TGGTGTTTGA ACGGTTTTAC CGGTCGGCAT CGGCACGGGC GTTGCCGGGT 60 

TCGGGCCTCG GGTTGGCGAT CGTCAAACAG GTGGTGCTCA ACCACGGCGG ATTGCTGCGC 120 

ATCGAAGACA CCGACCCAGG CGGCCAGCCC CCTGGAACGT CGATTTACGT GCTGCTCCCC 180 

GGCCGTCGGA TGCCGATTCC GCAGCTTCCC GGTGCGACGG CTGGCGCTCG GAGCACGGAC 240 

ATCGAGAACT CTCGGGGTTC GGCGAACGTT ATCTCAGTGG AATCTCAGTC CACGCGCGCA 300 

ACCTAGTTGT GCAGTTACTG TTGAAAGCCA CACCCATGCC AGTCCACGCA TGGCCAAGTT 360 

GGCCCGAGTA GTGGGCCTAG TACAGGAAGA GCAACCTAGC GACATGACGA ATCACCCACG 420 

GTATTCGCCA CCGCCGCAGC AGCCGGGAAC CCCAGGTTAT GCTCAGGGGC AGCAGCAAAC 480 

GTACAGCCAG CAGTTCGACT GGCGTTACCC ACCGTCCCCG CCCCCGCAGC CAACCCAGTA 540 

CCGTCAACCC TACGAGGCGT TGGGTGGTAC CCGGCCGGGT CTGATACCTG GCGTGATTCC 600 



GCAAGCAGCT GCAGGTCGTG 

AGACCGGGTA- GAGGACGGAT 
^ CGTTTCTGCA ACATCTGCTC 

Jr GGTTGCTCCA AGCGGTGGCC 

2 CCGCGACCGG CCGGCTCTCC 

^ ACGCGGGCCG GCGGATCCGG 

CGGCCGACTA CAGCCAGATC 
jj£ TCATCGAGGC GTTCAACACC 

W GTGTGCCCAT CGACGAGGTC 

HH GGCTGGTTTA CGGGTTGAGC 

W AAGCCAACGA GCAGATGGAC 

B __ GCGCCGTAGT CGAGCGGGCC 

O GCTACCTGCC CGAGCTGGAC 

Rj CGCTGAACGC GCCGATCCAG 

PJ TCGACAAGGC GCTCAACGAG 

HI AGCTGCTGTT CGAAATCGCC 

p AGATGGGCGG CGCTTACCCG 

m GCTGGGACGC GGCGGCGCAC 

TTTCCGCCCT GAGTTCACGC 
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GACCATGACG CCCCCTCCTG GGATGGTTCG CCAACGCCCT CGTGCAGGCA TGTTGGCCAT 660 

CGGCGCGGTG ACGATAGCGG TGGTGTCCGC CGGCATCGGC GGCGCGGCCG CATCCCTGGT 720 

CGGGTTCAAC CGGGCACCCG CCGGCCCCAG CGGCGGCCCA GTGGCTGCCA GCGCGGCGCC 780 

AAGCATCCCC GCAGCAAACA TGCCGCCGGG GTCGGTCGAA CAGGTGGCGG CCAAGGTGGT 840 

GCCCAGTGTC GTCATGTTGG AAACCGATCT GGGCCGCCAG TCGGAGGAGG GCTCCGGCAT 900 

CATTCTGTCT GCCGAGGGGC TGATCTTGAC CAACAACCAC GTGATCGCGG CGGCCGCCAA 960 

GCCTCCCCTG GGCAGTCCGC CGCCGAAAAC GACGGTAACC TTCTCTGACG GGCGGACCGC 1020 

ACCCTTCACG GTGGTGGGGG CTGACCCCAC CAGTGATATC GCCGTCGTCC GTGTTCAGGG 1080 

CGTCTCCGGG CTCACCCCGA TCTCCCTGGG TTCCTCCTCG GACCTGAGGG TCGGTCAGCC 1140 

GGTGCTGGCG ATCGGGTCGC CGCTCGGTTT GGAGGGCACC GTGACCACGG GGATCGTCAG 1200 

CGCTCTCAAC CGTCCAGTGT CGACGACCGG CGAGGCCGGC AACCAGAACA CCGTGCTGGA 1260 

CGCCATTCAG ACCGACGCCG CGATCAACCC CGGTAACTCC GGGGGCGCGC TGGTGAACAT 1320 

GAACGCTCAA CTCGTCGGAG TCAACTCGGC CATTGCCACG CTGGGCGCGG ACTCAGCCGA 1380 

TGCGCAGAGC GGCTCGATCG GTCTCGGTTT TGCGATTCCA GTCGAC CAGG CCAAGCGCAT 1440 

CGCCGACGAG TTGATCAGCA CCGGCAAGGC GTCACATGCC TCCCTGGGTG TGCAGGTGAC 1500 

CAATGACAAA GACACCCCGG GCGCCAAGAT CGTCGAAGTA GTGGC CGGTG GTGCTGC CGC 1560 

GAACGCTGGA GTGCCGAAGG GCGTCGTTGT CACCAAGGTC GACGACCGCC CGATCAACAG 1620 

P CGCGGACGCG TTGGTTGCCG CCGTGCGGTC CAAAGCGCCG GGCGCCACGG TGGCGCTAAC 1680 

p CTTTCAGGAT CCCTCGGGCG GTAGCCGCAC AGTGCAAGTC ACCCTCGGCA AGGCGGAGCA 1740 

m GTGATGAAGG TCGCCGCGCA GTGTTCAAAG C 1771 

Qg (2) INFORMATION FOR SEQ ID NO: 14: 

jjj (i) SEQUENCE CHARACTERISTICS: 

~" (A) LENGTH: 1058 base pairs 

g (B) TYPE: nucleic acid 

jnyj (C) STRANDEDNESS : single 

=«1 (D) TOPOLOGY: linear 

Si (Xi) SEQUENCE DESCRIPTION: SEQ ID NO:14: 



CTCCACCGCG GTGGCGGCCG CTCTAGAACT AGTGGATCCC CCGGGCTGCA GGAATTCGGC 60 

ACGAGGATCC GACGTCGCAG GTTGTCGAAC CCGCCGCCGC GGAAGTATCG GTCCATGCCT 120 

AGCCCGGCGA CGGCGAGCGC CGGAATGGCG CGAGTGAGGA GGCGGGCAAT TTGGCGGGGC 180 

CCGGCGACGG CGAGCGCCGG AATGGCGCGA GTGAGGAGGC GGGCAGTCAT GCCCAGCGTG 240 

ATCCAATCAA CCTGCATTCG GCCTGCGGGC CCATTTGACA ATCGAGGTAG TGAGCGCAAA 300 

TGAATGATGG AAAACGGGCG GTGACGTCCG CTGTTCTGGT GGTGCTAGGT GCCTGCCTGG 360 

CGTTGTGGCT ATCAGGATGT TCTTCGCCGA AACCTGATGC CGAGGAACAG GGTGTTCCCG 420 

TGAGCCCGAC GGCGTCCGAC CCCGCGCTCC TCGCCGAGAT CAGGCAGTCG CTTGATGCGA 480 

CAAAAGGGTT GACCAGCGTG CACGTAGCGG TCCGAACAAC CGGGAAAGTC GACAGCTTGC 540 

TGGGTATTAC CAGTGC CGAT GTCGACGTCC GGGCCAATCC GCTCGCGGCA AAGGGCGTAT 600 

GCACCTACAA CGACGAGCAG GGTGTCCCGT TTCGGGTACA AGGCGACAAC ATCTCGGTGA 660 , 

AACTGTTCGA CGACTGGAGC AATCTCGGCT CGATTTCTGA ACTGTCAACT TCACGCGTGC 720 

TCGATCCTGC CGCTGGGGTG ACGCAGCTGC TGTCCGGTGT CACGAACCTC CAAGCGCAAG 780 

GTACCGAAGT GATAGACGGA ATTTCGACCA CCAAAATCAC CGGGACCATC CCCGCGAGCT 840 

CTGTCAAGAT GCTTGATCCT GGCGCCAAGA GTGCAAGGCC GGCGACCGTG TGGATTGCCC 900 

AGGACGGCTC GCACCACCTC GTCCGAGCGA GCATCGACCT CGGATCCGGG TCGATTCAGC 960 

TCACGCAGTC GAAATGGAAC GAACCCGTCA ACGTCGACTA GGCCGAAGTT GCGTCGACGC 1020 

GTTGNTCGAA ACGCCCTTGT GAACGGTGTC AACGGNAC 1058 

(2) INFORMATION FOR SEQ ID NO: 15: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 542 base pairs 

(B) TYPE: nucleic acid 



o 
o 
ra 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

GAATTCGGCA CGAGAGGTGA TCGACATCAT CGGGACCAGC CCCACATCCT GGGAACAGGC 60 

GGCGGCGGAG GCGGTCCAGC GGGCGCGGGA TAGCGTCGAT GACATCCGCG TCGCTCGGGT 120 

CATTGAGCAG GACATGGCCG TGGACAGCGC CGGCAAGATC ACCTACCGCA TCAAGCTCGA 180 

AGTGTCGTTC AAGATGAGGC CGGCGCAACC GCGCTAGCAC GGGCCGGCGA GCAAGACGCA 240 

AAATCGCACG GTTTGCGGTT GATTCGTGCG ATTTTGTGTC TGCTCGCCGA GGCCTACCAG 300 

GCGCGGCCCA GGTCCGCGTG CTGCCGTATC CAGGCGTGCA TCGCGATTCC GGCGGCCACG 360 

CCGGAGTTAA TGCTTCGCGT CGACCCGAAC TGGGCGATCC GCCGGNGAGC TGATCGATGA 420 

CCGTGGCCAG CCCGTCGATG CCCGAGTTGC CCGAGGAAAC GTGCTGCCAG GCCGGTAGGA 480 

AGCGTCCGTA GGCGGCGGTG CTGACCGGCT CTGCCTGCGC CCTCAGTGCG GCCAGCGAGC 540 

GG 542 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 913 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:16: 



M CGGTGCCGCC CGCGCCTCCG TTGCCCCCAT TGCCGCCGTC GCCGATCAGC TGCGCATCGC 60 

.nJ CACCATCACC GCCTTTGCCG CCGGCACCGC CGGTGGCGCC GGGGCCGCCG ATGCCACCGC 120 

fy TTGACCCTGG CCGCCGGCGC CGCCATTGCC ATACAGCACC CCGCCGGGGG CACCGTTACC 180 

fi] GCCGTCGCCA CCGTCGCCGC CGCTGCCGTT TCAGGCCGGG GAGGCCGAAT GAACCGCCGC 240 

O CAAGCCCGCC GCCGGCACCG TTGCCGCCTT TTCCGCCCGC CCCGCCGGCG CCGCCAATTG 300 

CCGAACAGCC AMGCACCGTT GCCGCCAGCC CCGCCGCCGT TAACGGCGCT GCCGGGCGCC 360 

GCCGCCGGAC CCGCCATTAC CGCCGTTCCC GTTCGGTGCC CCGCCGTTAC CGGCGCCGCC 420 

GTTTGCCGCC AATATTCGGC GGGCACCGCC AGACCCGCCG GGGC CACCAT TGCCGCCGGG 480 

CACCGAAACA ACAGCCCAAC GGTGCCGCCG GCCCCGCCGT TTGCCGCCAT CACCGGCCAT 540 

TCACCGCCAG CACCGCCGTT AATGTTTATG AACCCGGTAC CGCCAGCGCG GCCCCTATTG 600 

CCGGGCGCCG GAGNGCGTGC CCGCCGGCGC CGCCAACGCC CAAAAGCCCG GGGTTGCCAC 660 

CGGCCCCGCC GGACCCACCG GTCCCGCCGA TCCCCCCGTT GCCGCCGGTG CCGCCGCCAT 720 

TGGTGCTGCT GAAGCCGTTA GCGCCGGTTC CGCSGGTTCC GGCGGTGGCG CCNTGGCCGC 780 

CGGCCCCGCC GTTGC CGT AC AGCCACCCCC CGGTGGCGCC GTTGCCGCCA TTGCCGCCAT 840 

TGCCGCCGTT GCCGCCATTG CCGCCGTTCC CGCCGCCACC GCCGGNTTGG CCGCCGGCGC 900 

CGCCGGCGGC CGC 913 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1872 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 



GACTACGTTG GTGTAGAAAA ATCCTGCCGC CCGGACCCTT AAGGCTGGGA CAATTTCTGA 
TAGCTACCCC GACACAGGAG GTTACGGGAT GAGCAATTCG CGCCGCCGCT CACTCAGGTG 



60 
120 
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GTCATGGTTG 
GGCGGCCCCG 
CCCGTCCGCG 
CTACAACAAC 
GAC CAACAAC 
CCAAACCTAC 
GCTGCGCGGT 
GCCCGTCGTC 
CAGGGTGGTC 
GACATTGAAC 
CGTCGTCAAC 
GCTGTCCCAG 
CCAAATCCGA 
CTTGGGTGTT 
TCCGGCGGCA 
GATGAAGTGG 
CTCGGTGAAC 
GGGACCCCCG 
CAGCCGTGAT 
GCAATGAACG 
GGCGGTGTGG 
GATCCGACCT 
GCCAGCGCGG 
TGGCTTGGCA 
GGTTACGACA 
GTCGCCCTGG 
AATCACACCT 
TACGGTGACT 
TTCGTCGACA 
GCACCGATTC 



CTGAGCGTGC 
CCGGCCTTGT 
ATGGTCGCCC 
GCCGTGGGCG 
CACGTGATCG 
GGCGTCGATG 
GCCGGTGGCC 
GCGATGGGCA 
GCGCTCGGCC 
GGGTTGATCC 
GGCCTAGGAC 
GGTGGGCAGG 
TCGGGTGGGG 
GTCGACAACA 
AGTCTCGGCA 



TGGCAAACCA 
GCCTGATTTG 
TGCCGCGTGA 
AGGCAGAACA 
TCGAGCATCC 
GGTTTAAGCA 
ACGGTTCCGN 
TCGACTGCAT 
TTCGCGACTT 
TCGACACCGC 
CGGAGTCGCA 
ATTACGTGTG 
CCGAAGAGTC 
TT 



TGGCTGCCGT 
CGCAGGAC CG 
AAGTGGCGCC 
CCGGGACCGG 
CGGGCGCCAC 
TGGTCGGGTA 
TGCCGTCGGC 
ACAGCGGTGG 
AAACCGTGCA 
AGTTCGATGC 
AGGTGGTCGG 
GATTCGCCAT 
GGTCACCCAC 
ACGGCAACGG 
TCTCCACCGG 
TGGCGGACGC 
AGTCGGGCGG 
TCGCGGATAC 
GCCCCCGAGT 
CAGCGTTGAG 
GGATGCCAAG 
CGCCGTCTTC 
CGATCTGCGT 
CTGTTGCCGC 
CTACAAGGTG 
TCACCGGCGA 
CCCCTGGTTT 
GAGCGACACC 
GAACTGGTCA 



CGGGCTGGGC 
GTTCGCCGAC 
ACAGGTGGTC 
CATCGTCATC 
CGACATCAAT 
TGACCGCACC 
GGCGATCGGT 
GCAGGGCGGA 
GGCGTCGGAT 
CGCAATCCAG 
TATGAACACG 
TCCGATCGGG 
CGTTCATATC 
CGCACGAGTC 
CGACGTGATC 
GCTTAACGGG 
CACGCGTACA 
CACCCGCCGG 
TCCGTCTCCC 
CACCCTCCCG 
GACTTCGGCA 
TACGAGGTGC 
GGACTCATCG 
CGTTCCTACG 
CTGCC CGAAT 
GGTATCCGCA 
CAGGAGTCCC 
AGCGAGCGCT 
TTCGATCCTG 



CTGGC CACGG 
TTCCCCGCGC 
AACATCAACA 
GATCCCAACG 
GCGTTCAGCG 
CAGGATGTCG 
GGCGGCGTCG 
ACGCCCCGTG 
TCGCTGACCG 
CCCGGTGATT 
GCCGCGTCCG 
CAGGCGATGG 
GGGCCTACCG 
CAACGCGTGG 
ACCGCGGTCG 
CATCATCCCG 
GGGAACGTGA 
CCGGCCAATT 
GTGCGCGTGG 
TGCAGGGCAG 
GCGCCGCCGC 
TGGTCCGGGC 
ATCGCCTCGA 
ACTCACCGCT 
TCGGCACCGT 
TCATCACCGA 
GCCGCGACCC 
ACACCGACGC 
TCCGCCGACA 



CGCCGGCCCA 
TGCCCCTCGA 
CCAAACTGGG 
GTGTCGTGCT 
TCGGCTCCGG 
CGGTGCTGCA 
CGGTTGGTGA 
CGGTGC CTGG 
GTGCCGAAGA 
CGGGCGGGCC 
ATAACTTCCA 
CGATCGCGGG 
CCTTCCTCGG 
TCGGAAGCGC 
ACGGCGCTCC 
GTGACGTCAT 
CATTGGCCGA 
GGATTGGCGC 
CATTGTGGAA 
TTACGTCGAA 
CCTGCCCGCC 
GTTCTTCGAC 
CTACCTGCAG 
GCGCGACGGC 
CGACGATTTC 
CCTGGTGATG 
AGACGGACCG 
CCGGATCATC 
GTTNCTACTG 



180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1872 



(2) INFORMATION FOR SEQ ID NO: 18: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1482 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 



CTTCGCCGAA 
CCGCGCTCCT 
ACGTAGCGGT 
TCGACGTCCG 
GTGTCCCGTT 
ATCTCGGCTC 
CGCAGCTGCT 
TTTCGACCAC 
GCGC CAAGAG 
TCCGAGCGAG 
AACCCGTCAA 
AACGGTGTCA 
GACCGGGCGG 
CGGTCTTTGA 



ACCTGATGCC 
CGCCGAGATC 
CCGAACAACC 
GGCCAATCCG 
TCGGGTACAA 
GATTTCTGAA 
GTCCGGTGTC 
CAAAATCACC 
TGCAAGGCCG 
CATCGACCTC 
CGTCGACTAG 
ACGGCACCCG 
TTGGTGGTTA 
GCCGGTAGCT 



GAGGAACAGG 
AGGCAGTCGC 
GGGAAAGTCG 
CTCGCGGCAA 
GGCGACAACA 
CTGTCAACTT 
ACGAACCTCC 
GGGACCATCC 
GCGACCGTGT 
GGATCCGGGT 
GCCGAAGTTG 
AAAACTGACC 
TTCTTCGGTG 
GTCGCCTTTG 



GTGTTCCCGT 
TTGATGCGAC 
ACAGCTTGCT 
AGGGCGTATG 
TCTCGGTGAA 
CACGCGTGCT 
AAGCGCAAGG 
CCGCGAGCTC 
GGATTGCCCA 
CGATTCAGCT 
CGTCGACGCG 
CCCTGACGGC 
GTTCCGGCTG 
AGGGCGACGA 



GAGCCCGACG 
AAAAGGGTTG 
GGGTATTACC 
CACCTACAAC 
ACTGTTCGAC 
CGATCCTGCC 
T AC CGAAGTG 
TGTCAAGATG 
GGACGGCTCG 
CACGCAGTCG 
TTGCTCGAAA 
ATCTGAAAAT 
GTGGGACGCG 
CTTCAGCATG 



GCGTCCGACC 
ACCAGCGTGC 
AGTGCCGATG 
GACGAGCAGG 
GACTGGAGCA 
GCTGGGGTGA 
ATAGACGGAA 
CTTGATC CTG 
CACCACCTCG 
AAATGGAACG 
CGCCCTTGTG 
TGACCCCCTA 
GCCGAGGTCG 
GTGGACGAGG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
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CGGTCGATCA TGGCGGCAGC AACGACGTCG TCGCCGCCGA AAACCTCGCC CCACCGGCCG 900 

AAGGCCTTAT TGGACGTGAC GATCAAGCTG GCCCGCTCAT ACCGGGAGGA CACCAGCTGG 960 

AAGAAGAGGT TGGCGGCCTC GGGCTCAAAC GGAATGTAAC CGACTTCGTC AACCACCAGG 1020 

AGCGGATAGC GGCCAAACCG GGTGAGTTCG GCGTAGATGC GCCCGGCGTG GTGAGCCTCG 1080 

GCGAACCGTG CTACCCATTC GGCGGCGGTG GCGAACAGCA CCCGATGACC GGCCTGACAC 1140 

GCGCGTATCG CCAGGCCGAC CGCAAGATGA GTCTTCCCGG TGCCAGGCC3G GGCCCAAAAA 1200 

CACGACGTTA TCGCGGGCGG TGATGAAATC CAGGGTGCCC AGATGTGCGA TGGTGTCGCG 1260 

TTTGAGGCCA CGAGCATGCT CAAAGTCGAA CTCTTCCAAC GACTTCCGAA CCGGGAAGCG 1320 

GGCGGCGCGG ATGCGGCCCT CACCACCATG GGACTCCCGG GCTGACACTT CCCGCTGCAG 1380 

GCAGGCGGCC AGGTATTCTT CGTGGCTCCA GTTCTCGGCG CGGGCGCGAT CGGCCAGCCG 1440 

GGACACTGAC TCACGCAGGG TGGGAGCTTT CAATGCTCTT GT 1482 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) -liENSTK-: 376 base pairs 
jy, (B) TYPE: nucleic acid 

p (C) STRANDEDNESS : single 

p (D) TOPOLOGY: linear 

J2 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

GAATTCGGCA CGAGCCGGCG ATAGCTTCTG GGCCGCGGCC GACCAGATGG CTCGAGGGTT 60 

CGTGCTCGGG GCCACCGCCG GGCGCACCAC CCTGACCGGT GAGGGCCTGC AACACGCCGA 120 

CGGTCACTCG TTGCTGCTGG ACGCCACCAA CCCGGCGGTG GTTGCCTACG ACCCGGCCTT 180 

CGCCTACGAA ATCGGCTACA TCGNGGAAAG CGGACTGGCC AGGATGTGCG GGGAGAACCC 24 0 

GGAGAACATC TTCTTCTACA TCACCGTCTA CAACGAGCCG TACGTGCAGC CGCCGGAGCC 300 

GGAGAACTTC GATCCCGAGG GCGTGCTGGG GGGTATCTAC CGNTATCACG CGGCCACCGA 360 

|'** GCAACGCACC AACAAGGNGC AGATCCTGGC CTCCGGGGTA GCGATGCCCG CGGCGCTGCG 42 0 

j£j GGCAGCACAG ATGCTGGCCG CCGAGTGGGA TGTCGCCGCC GACGTGTGGT CGGTGACCAG 480 

O TTGGGGCGAG CTAAACCGCG ACGGGGTGGT CATCGAGACC GAGAAGCTCC GCCACCCCGA 540 

fy TCGGCCGGCG GGCGTGCCCT ACGTGACGAG AGCGCTGGAG AATGCTCGGG GCCCGGTGAT 600 

CGCGGTGTCG GACTGGATGC GCGCGGTCCC CGAGCAGATC CGACCGTGGG TGCCGGGCAC 660 

ATACCTCACG TTGGGCACCG ACGGGTTCGG TTTTTCCGAC ACTCGGCCCG CCGGTCGTCG 720 

TTACTTCAAC ACCGACGCCG AATCCCAGGT TGGTCGCGGT TTTGGGAGGG GTTGGCCGGG 780 

TCGACGGGTG AATATCGACC CATTCGGTGC CGGTCGTGGG CCGCCCGCCC AGTTACCCGG 840 

ATTCGACGAA GGTGGGGGGT TGCGCCCGAN TAAGTT 876 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1021 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

ATCCCCCCGG GCTGCAGGAA TTCGGCACGA GAGACAAAAT TCCACGCGTT AATGCAGGAA 60 

CAGATTCATA ACGAATTCAC AGCGGCACAA CAATATGTCG CGATCGCGGT TTATTTCGAC 120 

AGCGAAG AC C TGCCGCAGTT GGCGAAGCAT TTTTACAGCC AAGCGGTCGA GGAACGAAAC 180 

CATGCAATGA TGCTCGTGCA ACACCTGCTC GACCGCGACC TTCGTGTCGA AATTCCCGGC 240 

GTAGACACGG TGCGAAACCA GTTCGACAGA CCCCGCGAGG CACTGGCGCT GGCGCTCGAT 300 

CAGGAACGCA CAGTCACCGA CCAGGTCGGT CGGCTGACAG CGGTGGCCCG CGACGAGGGC 360 

GATTTCCTCG GCGAGCAGTT CATGCAGTGG TTCTTG CAGG AACAGATCGA AGAGGTGGCC 420 



s 
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TTGATGGCAA CCCTGGTGCG GGTTGCCGAT CGGGCCGGGG CCAACCTGTT CGAGCTAGAG 480 

AACTTCGTCG CACGTGAAGT GGATGTGGCG CCGGCCGCAT CAGGCGCCCC GCACGCTGCC 540 

GGGGGCCGCC TCTAGATCCC TGGGGGGGAT CAGCGAGTGG TCCCGTTCGC CCGCCCGTCT 600 

TCCAGCCAGG CCTTGGTGCG GCCGGGGTGG TGAGTACCAA TCCAGGCCAC CCCGACCTCC 660 

CGGNAAAAGT CGATGTCCTC GTACTCATCG ACGTTCCAGG AGTACACCGC CCGGCCCTGA 720 

GCTGCCGAGC GGTCAACGAG TTGCGGATAT TCCTTTAACG CAGGCAGTGA GGGTCCCACG 780 

GCGGTTGGCC CGACCGCCGT GGCCGCACTG CTGGTCAGGT ATCGGGGGGT CTTGGCGAGC 840 

AACAACGTCG GCAGGAGGGG TGGAGCCCGC CGGATCCGCA GACCGGGGGG GCGAAAACGA 900 

CATCAACACC GCACGGGATC GATCTGCGGA GGGGGGTGCG GGAATACCGA ACCGGTGTAG 960 

GAGCGCCAGC AGTTGTTTTT CCACCAGCGA AGCGTTTTCG GGTCATCGGN GGCNNTTAAG 1020 

T 1021 

(2) INFORMATION FOR SEQ ID NO:21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LiENGTK : 321. base pairs 
jlj. (B) TYPE: nucleic acid 

q (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

"» (xi) SEQUENCE DESCRIPTION: SEQ ID NO:21: 

*2 CGTGCCGACG AACGGAAGAA CACAAC CATG AAGATGGTGA AATCGATCGC CGCAGGTCTG 60 

jyj ACCGCCGCGG CTGCAATCGG CGCCGCTGCG GCCGGTGTGA CTTCGATCAT GGCTGGCGGN 120 

W CCGGTCGTAT ACCAGATGCA GCCGGTCGTC TTCGGCGCGC CACTGCCGTT GGACCCGGNA 180 

L_ TCCGCCCCTG ANGTCCCGAC CGCCGCCCAG TGGACCAGNC TGCTCAACAG NCTCGNCGAT 240 

Uj CCCAACGTGT CGTTTGNGAA CAAGGGNAGT CTGGTCGAGG GNGGNATCGG NGGNANCGAG 300 

J,J| GGNGNGNATC GNCGANCACA A 321 

U] (2) INFORMATION FOR SEQ ID NO: 22: 

FU (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 373 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

TCTTATCGGT TCCGGTTGGC GACGGGTTTT GGGNGCGGGT GGTTAACCCG CTCGGCCAGC 60 

CGATCGACGG GCGCGGAGAC GTCGACTCCG ATACTCGGCG CGCGCTGGAG CTCCAGGCGC 120 

CCTCGGTGGT GNACCGGCAA GGCGTGAAGG AGCCGTTGNA GACCGGGATC AAGGCGATTG 180 

ACGCGATGAC CCCGATCGGC CGCGGGCAGC GCCAGCTGAT CATCGGGGAC CGCAAGACCG 240 

GCAAAAACCG CCGTCTGTGT CGGACAC CAT CCTCAAACCA GCGGGAAGAA CTGGGAGTCC 300 

GGTGGATCCC AAGAAGCAGG TGCGCTTGTG TATACGTTGG CCATCGGGCA AGAAGGGGAA 360 

CTTACCATCG CCG 373 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 352 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 



GTGACGCCGT GATGGGATTC CTGGGCGGGG CCGGTCCGCT GGCGGTGGTG GATCAGCAAC 60 

TGGTTACCCG GGTGCCGCAA GGCTGGTCGT TTGCTCAGGC AGCCGCTGTG CCGGTGGTGT 120 

TCTTGACGGC CTGGTACGGG TTGGCCGATT TAGCCGAGAT CAAGGCGGGC GAATCGGTGC 180 

TGATCCATGC CGGTACCGGC GGTGTGGGCA TGGCGGCTGT GCAGCTGGQT CGCCAGTGGG 24 0 

GCGTGGAGGT TTTCGTCACC GCCAGCCGTG GNAAGTGGGA CACGCTGCGC GCCATNGNGT 300 

TTGACGACGA NCCATATCGG NGATTCCCNC ACATNCGAAG TTCCGANGGA GA 3 52 



(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 726 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 



O GAAATCCGCG TTCATTCCGT TCGACCAGCG GCTGGCGATA ATCGACGAAG TGATCAAGCC 60 

Ejj GCGGTTCGCG GCGCTCATGG GTCACAGCGA GTAATCAGCA AGTTCTCTGG TATATCGCAC 12 0 

*F CTAGCGTCCA GTTGCTTGCC AGATCGCTTT CGTACCGTCA TCGCATGTAC CGGTTCGCGT 180 

63 GCCGCACGCT CATGCTGGCG GCGTGCATCC TGGCCACGGG TGTGGCGGGT CTCGGGGTCG 240 

=F GCGCGCAGTC CGCAGCCCAA ACCGCGCCGG TGCCCGACTA CTACTGGTGC CCGGGGCAGC 300 

bj CTTTCGACCC CGCATGGGGG CCCAACTGGG ATCCCTACAC CTGCCATGAC GACTTCCACC 360 

s GCGACAGCGA CGGCCCCGAC CACAGCCGCG ACTACCCCGG ACCCATCCTC GAAGGTCCCG 42 0 

O TGCTTGACGA TCCCGGTGCT GCGCCGCCGC CCCCGGCTGC CGGTGGCGGC GCATAGCGCT 480 

pj CGTTGACCGG GCCGCATCAG CGAATACGCG TATAAACCCG GGCGTGCCCC CGGCAAGCTA 54 0 

fy CGACCCCCGG CGGGGCAGAT TTACGCTCCC GTGCCGATGG ATCGCGCCGT CCGATGACAG 600 

Ijl AAAATAGGCG ACGGTTTTGG CAACCGCTTG GAGGACGCTT GAAGGGAACC TGTCATGAAC 66 0 

p GGCGACAGCG CCTCCACCAT CGACATCGAC AAGGTTGTTA CCCGCACACC CGTTCGCCGG 720 

m ATCGTG 726 



(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 580 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 



CGCGACGACG ACGAACGTCG GGCCCACCAC CGCCTATGCG TTGATG CAGG CGACCGGGAT 6 0 

GGTCGCCGAC CATATCCAAG CATGCTGGGT GCCCACTGAG CGACCTTTTG ACCAGCCGGG 120 

CTGCCCGATG GCGGCCCGGT GAAGTCATTG CGCCGGGGCT TGTGCACCTG ATGAACCCGA 180 

ATAGGGAACA ATAGGGGGGT GATTTGGCAG TTCAATGTCG GGTATGGCTG GAAATCCAAT 24 0 

GGCGGGGCAT GCTCGGCGCC GACCAGGCTC GCGCAGGCGG GCCAGCCCGA ATCTGGAGGG 300 

AGCACTCAAT GGCGGCGATG AAGCCCCGGA CCGGCGACGG TCCTTTGGAA GCAACTAAGG 360 

AGGGGCGCGG CATTGTGATG CGAGTACCAC TTGAGGGTGG CGGTCGCCTG GTCGTCGAGC 420 

TGACACCCGA CGAAGCCGCC GCACTGGGTG ACGAACTCAA AGGCGTTACT AGCTAAGACC 480 

AGCCCAACGG CGAATGGTCG GCGTTACGCG CACACCTTCC GGTAGATGTC CAGTGTCTGC 54 0 

TCGGCGATGT ATGCCCAGGA GAACTCTTGG ATACAGCGCT 580 



(2) INFORMATION FOR SEQ ID NO: 26: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 160 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 

AACGGAGGCG CCGGGGGTTT TGGCGGGGCC GGGGCGGTCG GCGGCAACGG CGGGGCCGGC 60 
GGTACCGCCG GGTTGTTCGG TGTCGGCGGG GCCGGTGGGG CCGGAGGCAA CGGCATCGCC 120 
GGTGTCACGG GTACGTCGGC CAGCACACCG GGTGGATCCG 160 



(2) INFORMATION FOR SEQ ID NO: 27: 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 272 base pairs 
H* (B) TYPE: nucleic acid 

O (C) STRANDEDNESS: single 

p (D) TOPOLOGY: linear 

JH (xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 

CO 

jp GACACCGATA CGATGGTGAT GTACGCCAAC GTTGTCGACA CGCTCGAGGC GTTCACGATC 60 

LhJ CAGCGCACAC CCGACGGCGT GACCATCGGC GATGCGGCCC CGTTCGCGGA GGCGGCTGCC 120 

s AAGGCGATGG GAATCGACAA GCTGCGGGTA ATTCATACCG GAATGGACCC CGTCGTCGCT 18 0 

Q GAACGCGAAC AGTGGGACGA CGGCAACAAC ACGTTGGCGT TGGCGCCCGG TGTCGTTGTC 240 

pj GCCTACGAGC GCAACGTACA GACCAACGCC CG 272 

iS (2) INFORMATION FOR SEQ ID NO: 28: 

Q 

m (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 317 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 



GCAGCCGGTG GTTCTCGGAC TATCTGCGCA CGGTGACGCA GCGCGACGTG CGCGAGCTGA 60 

AGCGGATCGA GCAGACGGAT CGCCTGCCGC GGTTCATGCG CTACCTGGCC GCTATCACCG 120 

CGCAGGAGCT GAACGTGGCC GAAGCGGCGC GGGTCATCGG GGTCGACGCG GGGACGATCC 180 

GTTCGGATCT GGCGTGGTTC GAGACGGTCT ATCTGGTACA TCGCCTGCCC GCCTGGTCGC 240 

GGAATCTGAC CGCGAAGATC AAGAAGCGGT CAAAGATCCA CGTCGTCGAC AGTGGCTTCG 300 

CGGCCTGGTT GCGCGGG 317 

(2) INFORMATION FOR SEQ ID NO: 29: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 182 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
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GATCGTGGAG CTGTCGATGA ACAGCGTTGC CGGACGCGCG GCGGCCAGCA CGTCGGTGTA 
GCAGCGC CGG ACCACCTCGC CGGTGGGCAG CATGGTGATG ACCACGTCGG CCTCGGCCAC 
CGCTTCGGGC GCGCTACGAA ACACCGCGAC ACCGTGCGCG GCGGCGCCGG ACGCCGCCGT 
GG 

(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 308 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:30: 

GATCGCGAAG TTTGGTGAGC AGGTGGTCGA CGCGAAAGTC TGGGCGCCTG CGAAGCGGGT 
CGGCGTTCAC GAGGCGAAGA CACGCCTGTC CGAGCTGCTG CGGCTCGTCT ACGGCGGGCA 
GAGGTTGAGA TTGCCCGCCG CGGCGAGCCG GTAGCAAAGC TTGTGCCGCT GCATCCTCAT 
GAGACTCGGC GGTTAGGCAT TGACCATGGC GTGTACCGCG TGCCCGACGA TTTGGACGCT 
CCGTTGTCAG ACGACGTGCT CGAACGCTTT CACCGGTGAA GCGCTACCTC ATCGACACCC 
ACGTTTGG 

(2) INFORMATION FOR SEQ ID NO: 31: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 267 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

CCGACGACGA GCAACTCACG TGGATGATGG TCGGCAGCGG CATTGAGGAC GGAGAGAATC 
CGGCCGAAGC TGCCGCGCGG CAAGTGCTCA TAGTGACCGG CCGTAGAGGG CTCCCCCGAT 
GGCACCGGAC TATTCTGGTG TGCCGCTGGC CGGTAAGAGC GGGTAAAAGA ATGTGAGGGG 
ACACGATGAG CAATCACACC TACCGAGTGA TCGAGATCGT CGGGACCTCG CCCGACGGCG 
TCGACGCGGC AATCCAGGGC GGTCTGG 

<2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1539 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

CTCGTGCCGA AAGAATGTGA GGGGACACGA TGAGCAATCA CACCTACCGA GTGATCGAGA 
TCGTCGGGAC CTCGCCCGAC GGCGTCGACG CGGCAATCCA GGGCGGTCTG GCCCGAGCTG 
CGCAGAC CAT GCGCGCGCTG GACTGGTTCG AAGTACAGTC AATTCGAGGC CACCTGGTCG 
ACGGAGCGGT CGCGCACTTC CAGGTGACTA TGAAAGTCGG CTTCCGCTGG AGGATTCCTG 
AACCTTCAAG CGCGGCCGAT AACTGAGGTG CATCATTAAG CGACTTTTCC AGAACATCCT 
GACGCGCTCG AAACGCGGTT CAGCCGACGG TGGCTCCGCC GAGGCGCTGC CTCCAAAATC 
CCTGCGACAA TTCGTCGGCG GCGCCTACAA GGAAGTCGGT GCTGAATTCG TCGGGTATCT 
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GGTCGACCTG TGTGGGCTGC AGCCGGACGA AGCGGTGCTC GACGTCGGCT GCGGCTCGGG 480 

GCGGATGGCG TTGCCGCTCA CCGGCTATCT GAACAGCGAG GGACGCTACG CCGGCTTCGA 54 0 

TATCTCGCAG AAAGCCATCG CGTGGTGCCA GGAGCACATC ACCTCGGCGC ACCCCAACTT 600 

CCAGTTCGAG GTCTCCGACA TCTACAACTC GCTGTACAAC CCGAAAGGGA AATACCAGTC 660 

ACTAGACTTT CGCTTTCCAT ATCCGGATGC GTCGTTCGAT GTGGTGTTTC TTACCTCGGT 72 0 

GTTCACCCAC ATGTTTCCGC CGGACGTGGA GCACTATCTG GACGAGATCT CCCGCGTGCT 780 

GAAGCCCGGC GGACGATGCC TGTGCACGTA CTTCTTGCTC AATGACGAGT CGTTAGCCCA 84 0 

CATCGCGGAA GGAAAGAGTG CGCACAACTT CCAGCATGAG GGACCGGGTT ATCGGACAAT 900 

CCACAAGAAG CGGCCCGAAG AAGCAATCGG CTTGCCGGAG ACCTTCGTCA GGGATGTCTA 960 

TGGCAAGTTC GGCCTCGCCG TGCACGAACC ATTGCACTAC GGCTCATGGA GTGGCCGGGA 102 0 

ACCACGCCTA AGCTTCCAGG ACATCGTCAT CGCGACCAAA ACCGCGAGCT AGGTCGGCAT 1080 

CCGGGAAGCA TCGCGACACC GTGGCGCCGA GCGCCGCTGC CGGCAGGCCG ATTAGGCGGG 1140 

CAGATTAGCC CGCCGCGGCT CCCGGCTCCG AGTACGGCGC CCCGAATGGC GTCACCGGCT 1200 

GGTAACCACG CTTGCGCGCC TGGGCGGCGG CCTGCCGGAT CAGGTGGTAG ATGCCGACAA 1260 

AGCCTGCGTG ATCGGTCATC ACCAACGGTG ACAGCAGCCG GTTGTGCACC AGCGCGAACG 1320 

CCACCCGGGT CTGGGGGTGT GTGGAGCCGA TCGAGCCGCC CAAGCCCACA TGACCAAACC 1380 

CCGGCATCAC GTTGCCGATC GGCATACCGT GATAGCCAAG ATGAAAATTT AAGGGCACCA 1440 

ATAGATTTCG ATCCGGCAGA ACTTGCCGTC GGTTGCGGGT CAGGCCCGTG ACCAGCTCCC 1500 

GCGACAAGAA CCGTATGCCG TCGATCTCGC CTCGTGCCG 153 9 

(2) INFORMATION FOR SEQ ID NO: 33: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 851 base pairs 
Ld (B) TYPE: nucleic acid 

e (C) STRANDEDNESS : single 

Q (D) TOPOLOGY: linear 

ru 

ipy (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

in 

q CTGCAGGGTG GCGTGGATGA GCGTCACCGC GGGGCAGGCC GAGCTGACCG CCGCCCAGGT 60 

S[i CCGGGTTGCT GCGGCGGCCT ACGAGACGGC GTATGGGCTG ACGGTGCCCC CGCCGGTGAT 120 

CGCCGAGAAC CGTGCTGAAC TGATGATTCT GATAGCGACC AACCTCTTGG GGCAAAACAC 180 

CCCGGCGATC GCGGTCAACG AGGCCGAATA CGGCGAGATG TGGGCCCAAG ACGCCGCCGC 240 

GATGTTTGGC TACGCCGCGG CGACGGCGAC GGCGACGGCG ACGTTGCTGC CGTTCGAGGA 300 

GGCGCCGGAG ATGACCAGCG CGGGTGGGCT CCTCGAGCAG GCCGCCGCGG TCGAGGAGGC 360 

CTCCGACACC GCCGCGGCGA AC CAGTTGAT GAACAATGTG CCCCAGGCGC TGAAACAGTT 420 

GGCCCAGCCC ACGCAGGGCA CCACGCCTTC TTCCAAGCTG GGTGGCCTGT GGAAGACGGT 480 

CTCGCCGCAT CGGTCGCCGA TCAGCAACAT GGTGTCGATG GCCAACAACC ACATGTCGAT 540 

GACCAACTCG GGTGTGTCGA TGACCAACAC CTTGAGCTCG ATGTTGAAGG GCTTTGCTCC 600 

GGCGGCGGCC GCCCAGGCCG TGCAAACCGC GGCGCAAAAC GGGGTCCGGG CGATGAGCTC 660 

GCTGGGCAGC TCGCTGGGTT CTTCGGGTCT GGGCGGTGGG GTGGCCGCCA ACTTGGGTCG 720 

GGCGGCCTCG GTACGGTATG GTCACCGGGA TGGCGGAAAA TATGCANAGT CTGGTCGGCG 780 " 

GAACGGTGGT CCGGCGTAAG GTTTACCCCC GTTTTCTGGA TGCGGTGAAC TTCGTCAACG 84 0 

GAAACAGTTA C 851 

(2) INFORMATION FOR SEQ ID NO: 34: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 254 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
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GATCGATCGG GCGGAAATTT GGACCAGATT CGCCTCCGGC GATAACCCAA TCAATCGAAC 60 

CTAGATTTAT TCCGTCCAGG GGCCCGAGTA ATGGCTCGCA GGAGAGGAAC CTTACTGCTG 120 

CGGGCACCTG TCGTAGGTCC TCGATACGGC GGAAGGCGTC GACATTTTCC ACCGACACCC 180 

CCATCCAAAC GTTCGAGGGC CACTCCAGCT TGTGAGCGAG GCGACGCAGT CGCAGGCTGC > 24 0 

GCTTGGTCAA GATC - 254 



(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1227 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNES S : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ TD NO : 3 5 : 



GATCCTGACC GAAGCGGCCG CCGCCAAGGC GAAGT CGCTG TTGGACCAGG AGGGACGGGA 60 

CGATCTGGCG CTGCGGATCG CGGTTCAGCC GGGGGGGTGC GCTGGATTGC GCTATAACCT 120 

□ TTTCTTCGAC GACCGGACGC TGGATGGTGA CCAAACCGCG GAGTTCGGTG GTGTCAGGTT 180 

00 GATCGTGGAC CGGATGAGCG CGCCGTATGT GGAAGGCGCG TCGATCGATT TCGTCGACAC 240 

=p TATTGAGAAG CAAGGTTCAC CATCGACAAT CCCAACGCCA CCGGCTCCTG CGCGTGCGGG 3 00 

09 GATTCGTTCA ACTGATAAAA CGCTAGTACG ACCCCGCGGT GCGCAACACG TACGAGCACA 360 

=P CCAAGACCTG ACCGCGCTGG AAAAGCAACT GAGCGATGCC TTGCACCTGA CCGCGTGGCG 420 

Ly GGCCGCCGGC GGCAGGTGTC ACCTGCATGG TGAACAGCAC CTGGGCCTGA TATTGCGACC 480 

s AGTACACGAT TTTGTCGATC GAGGTCACTT CGACCTGGGA GAACTGCTTG CGGAACGCGT 540 

Q CGCTGCTCAG CTTGGCCAAG GCCTGATCGG AGCGCTTGTC GCGCACGCCG TCGTGGATAC 6 00 

fjjj CGCACAGCGC ATTGCGAACG ATGGTGTCCA CATCGCGGTT CTCCAGCGCG TTGAGGTATC 660 

Fy CCTGAATCGC GGTTTTGGCC GGTCCCTCCG AGAATGTGCC TGCCGTGTTG GCTCCGTTGG 720 

[S| TGCGGACCCC GTATATGATC GCCGCCGTCA TAGCCGACAC CAGCGCGAGG GCTACCACAA 780 

q TGCCGATCAG CAGCCGCTTG TGCCGTCGCT TCGGGTAGGA CACCTGCGGC GGCACGCCGG 840 

GATATGCGGC GGGCGGCAGC GCCGCGTCGT CTGCCGGTCC CGGGGCGAAG GCCGGTTCGG 900 

' y CGGCGCCGAG GTCGTGGGGG TAGTCCAGGG CTTGGGGTTC GTGGGATGAG GGCTCGGGGT 960 

ACGGCGCCGG TCCGTTGGTG CCGACACCGG GGTTCGGCGA GTGGGGACCG GGCATTGTGG 1020 

TTCTCCTAGG GTGGTGGACG GGACCAGCTG CTAGGGCGAC AACCGCCCGT CGCGTCAGCC 1080 

GGCAGCATCG GCAATCAGGT GAGCTCCCTA GGCAGGCTAG CGCAACAGCT GCCGTCAGCT 1140 

CTCAACGCGA CGGGGCGGGC CGCGGCGCCG ATAATGTTGA AAGACTAGGC AACCTTAGGA 1200 

ACGAAGGACG GAGATTTTGT GACGATC 1227 



(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 181 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNES S : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 



GCGGTGTCGG CGGATCCGGC GGGTGGTTGA ACGGCAACGG CGGGGCCGGC GGGGCCGGCG 60 

GGACCGGCGC TAACGGTGGT GCCGGCGGCA ACGCCTGGTT GTTCGGGGCC GGCGGGTCCG 120 

GCGGNGCCGG CACCAATGGT GGNGTCGGCG GGTCCGGCGG ATTTGTCTAC GGCAACGGCG 180 

G 181 



(2) INFORMATION FOR SEQ ID NO: 37: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 290 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 



GCGGTGTCGG CGGATCCGGC GGGTGGTTGA ACGGCAACGG CGGTGTCGGC GGCCGGGGCG 60 

GCGACGGCGT CTTTGCCGGT GCCGGCGGCC AGGGCGGCCT CGGTGGGCAG GGCGGCAATG 120 

GCGGCGGCTC CACCGGCGGC AACGGCGGTC TTGGCGGCGC GGGCGGTGGC GGAGGCAACG 180 

CCCCGGACGG CGGCTTCGGT GGCAACGGCG GTAAGGGTGG CCAGGGCGGN ATTGGCGGCG 24 0 

GCACTCAGAG CGCGACCGGC CTCGGNGGTG ACGGCGGTGA CGGCGGTGAC 290 



(2) INFORMATION FOR SEQ ID NO: 38: 



•M= (i) SEQUENCE CHARACTERISTICS: 

O (A) LENGTH: 34 base pairs 

O (B) TYPE : nucleic acid 

69 (C) STRANDEDNESS: single 

JE (D) TOPOLOGY: linear 

m 

4* (xi) SEQUENCE DESCRIPTION: SEQ ID NO:38: 

yj 



s GATCCAGTGG CATGGNGGGT GTCAGTGGAA GCAT 34 

s 

fU (2) INFORMATION FOR SEQ ID NO: 39: 

nr 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 155 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

GATCGCTGCT CGTCCCCCCC TTGCCGCCGA CGCCACCGGT CCCACCGTTA CCGAACAAGC 60 
TGGCGTGGTC GCCAGCACCC CCGGCACCGC CGACGCCGGA GTCGAACAAT GGCACCGTCG 120 
TATCCCCACC ATTGCCGCCG GNCCCACCGG CACCG 155 

(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:40: 
ATGGCGTTCA CGGGGCGCCG GGGACCGGGC AGCCCGGNGG GGCCGGGGGG TGG 53 





(i) 



(2) INFORMATION FOR SEQ ID NO: 41: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 132 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:41: 

GATCCACCGC GGGTGCAGAC GGTGCCCGCG GCGCCACCCC GACCAGCGGC GGCAACGGCG 
GCACCGGCGG CAACGGCGCG AACGCCACCG TCGTCGGNGG GGCCGGCGGG GCCGGCGGCA 
AGGGCGGCAA CG 

(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) -LENGTH: i-32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:42: 

GATCGGCGGC CGGNACGGNC GGGGACGGCG GCAAGGGCGG NAACGGGGGC GCCGNAGCCA 
CCNGCCAAGA ATCCTCCGNG TCCNCCAATG GCGCGAATGG CGGACAGGGC GGCAACGGCG 
GCANCGGCGG CA 

(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 702 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 

CGGCACGAGG ATCGGTACCC CGCGGCATCG GCAGCTGCCG ATTCGCCGGG TTTCCCCACC 
CGAGGAAAGC CGCTACCAGA TGGCGCTGCC GAAGTAGGGC GATCCGTTCG CGATGCCGGC 
ATGAACGGGC GGCATCAAAT TAGTGCAGGA ACCTTTCAGT TTAGCGACGA TAATGGCTAT 
AGCACTAAGG AGGATGATCC GAT ATGACGC AGTCGCAGAC CGTGACGGTG GATCAGCAAG 
AGATTTTGAA CAGGGCCAAC GAGGTGGAGG CCCCGATGGC GGACCCACCG ACTGATGTCC 
CCATCACACC GTGCGAACTC ACGGNGGNTA AAAACGCCGC CCAACAGNTG GTNTTGTCCG 
CCGACAACAT GCGGGAATAC CTGGCGGCCG GTGCCAAAGA GCGGCAGCGT CTGGCG AC CT 
CGCTGCGCAA CGCGGCCAAG GNGTATGGCG AGGTTGATGA GGAGGCTGCG ACCGCGCTGG 
ACAACGACGG CGAAGGAACT GTGCAGGCAG AATCGGCCGG GGC CGTCGG A GGGGACAGTT 
CGGCCGAACT AACCGATACG CCGAGGGTGG CCACGGCCGG TGAACCCAAC TTCATGGATC 
TCAAAGAAGC GGCAAGGAAG CTCGAAACGG GCGACCAAGG CGCATCGCTC GCGCACTGNG 
GGGATGGGTG GAACACTTNC ACCCTGACGC TGCAAGGCGA CG 

(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 298 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 



□ 
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(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

GAAGCCGCAG CGCTGTCGGG CGACGTGGCG GTCAAAGCGG CATCGCTCGG TGGCGGTGGA 60 

GGCGGCGGGG TGCCGTCGGC GCCGTTGGGA TCCGCGATCG GGGGCGCCGA ATCGGTGCGG 120 

CCCGCTGGCG CTGGTGACAT TGCCGGCTTA GGC CAGGGAA GGGCCGGCGG CGGCGCCGCG 180 

CTGGGCGGCG GTGGCATGGG AATGCCGATG GGTGCCGCGC ATCAGGGACA AGGGGGCGCC 24 0 

AAGTCCAAGG GTTCTCAGCA GGAAGACGAG GCGCTCTACA CCGAGGATCC TCGTGCCG 298 

(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1058 base pairs 

(B) TYPE: nucleic acid 

( CI S-TRANDEDNESS : single 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45 : 



CGGCACGAGG ATCGAAT CGC GTCGCCGGGA GCACAGCGTC GCACTGCACC AGTGGAGGAG 60 

CCATGAC CT A CTCGCCGGGT AACCCCGGAT ACCCGCAAGC GCAGCCCGCA GGCTCCTACG 120 

GAGGCGTCAC ACCCTCGTTC GCCCACGCCG ATGAGGGTGC GAGCAAGCTA CCGATGTACC 180 

£ TGAACATCGC GGTGGCAGTG CTCGGTCTGG CTGCGTACTT CGCCAGCTTC GGCCCAATGT 240 

jjyg TCACCCTCAG TACCGAACTC GGGGGGGGTG ATGGCGCAGT GTCCGGTGAC ACTGGGCTGC 300 

~~ CGGTCGGGGT GGCTCTGCTG GCTGCGCTGC TTGCCGGGGT GGTTCTGGTG CCTAAGGCCA 360 

AGAGCCATGT GACGGTAGTT GCGGTGCTCG GGGTACTCGG CGTATTTCTG ATGGTCTCGG 420 

CGACGTTTAA CAAGCCCAGC GCCTATTCGA CCGGTTGGGC ATTGTGGGTT GTGTTGGCTT 480 

TCATCGTGTT CCAGGCGGTT GCGGCAGTCC TGGCGCTCTT GGTGGAGACC GGCGCTATCA 540 

CCGCGCCGGC GCCGCGGCCC AAGTTCGACC CGTATGGACA GTACGGGCGG TACGGGCAGT 600 

ACGGGCAGTA CGGGGTGCAG CCGGGTGGGT ACTACGGTCA GCAGGGTGCT CAGCAGGCCG 660 

CGGGACTGCA GTCGCCCGGC CCGCAGCAGT CTCCGCAGCC TCCCGGATAT GGGTCGCAGT 720 

ACGGCGGCTA TTCGTCCAGT CCGAGCCAAT CGGGCAGTGG ATACACTGCT CAGCCCCCGG 780 

CCCAGCCGCC GGCGCAGTCC GGGTCGCAAC AATCGCACCA GGGCCCATCC ACGCCACCTA 840 

CCGGCTTTCC GAGCTTCAGC CCACCACCAC CGGTCAGTGC CGGGACGGGG TCGCAGGCTG 900 

GTTCGGCTCC AGTCAACTAT TCAAACCCCA GCGGGGGCGA GCAGTCGTCG TCCCCCGGGG 960 

GGGCGCCGGT CTAACCGGGC GTTCCCGCGT CCGGTCGCGC GTGTGCGCGA AGAGTGAACA 1020 

GGGTGTCAGC AAGCGCGGAC GATCCTCGTG CCGAATTC 1058 



o 
ru 
§ y 

iff 

f4 



(2) INFORMATION FOR SEQ ID NO: 46: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 327 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 

CGGCACGAGA GACCGATGCC GCTACCCTCG CGCAGGAGGC AGGTAATTTC GAGCGGATCT 60 

CCGGCGACCT GAAAACCCAG ATCGACCAGG TGGAGTCGAC GGCAGGTTCG TTGCAGGGCC 120 

AGTGGCGCGG CGCGGCGGGG ACGGCCGCCC AGGCCGCGGT GGTGCGCTTC CAAGAAGCAG 180 

CCAATAAGCA GAAGCAGGAA CTCGACGAGA TCTCGACGAA TATTCGTCAG GCCGGCGTCC 240 

AATACTCGAG GGCCGACGAG GAGCAGCAGC AGGCGCTGTC CTCGCAAATG GGCTTCTGAC 300 

CCGCTAATAC GAAAAGAAAC GGAGCAA 327 
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(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 170 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

CGGTCGCGAT GATGGCGTTG TCGAACGTGA CCGATTCTGT ACCGCCGTCG TTGAGATCAA 60 
CCAACAACGT GTTGGCGTCG GCAAATGTGC CGNACCCGTG GATCTCGGTG ATCTTGTTCT 120 
TCTTCATCAG GAAGTGCACA CCGGCCACCC TGCCCTCGGN TACCTTTCGG 170 

C2 j INFORMATION TOR SEQ TD : 4 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 127 base pairs 



£3 (B) TYPE: nucleic acid 

TO (C) STRANDEDNESS: single 

4* (D) TOPOLOGY: linear 

CO 

JC (xi) SEQUENCE DESCRIPTION: SEQ ID NO:48: 

u 

E GATCCGGCGG CACGGGGGGT GCCGGCGGCA GCACCGCTGG CGCTGGCGGC AACGGCGGGG 60 

Q CCGGGGGTGG CGGCGGAACC GGTGGGTTGC TCTTCGGCAA CGGCGGTGCC GGCGGGCACG 120 

fy GGGCCGT 127 

m 

jjj (2) INFORMATION FOR SEQ ID NO: 49: 

o 

pi <i) SEQUENCE CHARACTERISTICS: 

"~ (A) LENGTH: 81 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 

CGGCGGCAAG GGCGGCACCG CCGGCAACGG GAGCGGCGCG GCCGGCGGCA ACGGCGGCAA 60 
CGGCGGCTCC GGCCTCAACG G 81 

(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 149 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 



GATCAGGGCT GGCCGGCTCC GGCCAGAAGG GCGGTAACGG AGGAGCTGCC GGATTGTTTG 
GCAACGGCGG GGCCGGNGGT GCCGGCGCGT CCAACCAAGC CGGTAACGGC GGNGC CGGCG 
GAAACGGTGG TGCCGGTGGG CTGATCTGG 



60 
120 
149 
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(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 55 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

CGGCACGAGA TCACACCTAC CGAGTGATCG AGATCGTCGG GACCTCGCCC GACGGTGTCG 
ACGCGGNAAT CCAGGGCGGT CTGGCCCGAG CTGCGCAGAC CATGCGCGCG CTGGACTGGT 
TCGAAGTACA GTCAATTCGA GGCCACCTGG TCGACGGAGC GGTCGCGCAC TTCCAGGTGA 
CTATGAAAGT CGGCTTCCGC CTGGAGGATT CCTGAACCTT CAAGCGCGGC CGATAACTGA 
GGTGCATCAT TAAGCGACTT TTCCAGAACA- TCCTGACGCG GTSGAAACSC GGTTCAGCCG 
ACGGTGGCTC CGCCGAGGCG CTGCCTCCAA AATCCCTGCG ACAATTCGTC GGCGG 

(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 999 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

ATGCATCACC ATCACCATCA CATGCATCAG GTGGACCCCA ACTTGACACG TCGCAAGGGA 
CGATTGGCGG CACTGG CTAT CGCGGCGATG GCCAGCGCCA GCCTGGTGAC CGTTGCGGTG 
CCCGCGACCG CCAACGCCGA TCCGGAGCCA GCGCCCCCGG TACCCACAAC GGCCGCCTCG 
CCGCCGTCGA CCGCTGCAGC GCCACCCGCA CCGGCGACAC CTGTTGCCCC CCCACCACCG 
GCCGCCGCCA ACACGCCGAA TGCCCAGCCG GGCGATCCCA ACGCAGCACC TCCGCCGGCC 
GACCCGAACG CACCGCCGCC ACCTGTCATT GCCCCAAACG CACCCCAACC TGTCCGGATC 
GACAACCCGG TTGGAGGATT CAGCTTCGCG CTGCCTGCTG GCTGGGTGGA GTCTGACGCC 
GCCCACTTCG ACTACGGTTC AGCACTCCTC AGCAAAACCA CCGGGGACCC GCCATTTCCC 
GGACAGCCGC CGCCGGTGGC CAATGACACC CGTATCGTGC TCGGCCGGCT AGACCAAAAG 
CTTTACGCCA GCGCCGAAGC CACCGACTCC AAGGC CGCGG CCCGGTTGGG CTCGGACATG 
GGTGAGTTCT ATATGCCCTA CCCGGGCACC CGGATCAACC AGGAAACCGT CTCGCTCGAC 
GCCAACGGGG TGTCTGGAAG CGCGTCGTAT TACGAAGTCA AGTTCAGCGA TCCGAGTAAG 
CCGAACGGCC AGATCTGGAC GGGCGTAATC GGCTCGCCCG CGGCGAACGC ACCGGACGCC 
GGGCCCCCTC AGCGCTGGTT TGTGGTATGG CTCGGGACCG CCAACAACCC GGTGGACAAG 
GGCGCGGCCA AGGCGCTGGC CGAATCGATC CGGCCTTTGG TCGCCCCGCC GCCGGCGCCG 
GCACCGGCTC CTGCAGAGCC CGCTCCGGCG CCGGCGCCGG CCGGGGAAGT CGCTCCTACC 
CCGACGACAC CGACACCGCA GCGGACCTTA CCGGCCTGA 

(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 32 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 
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5 
□ 

00 



S ==? 



Met His His His His His His Met His Gin Val Asp Pro Asn Leu Thr 

15 10 15 

Arg Arg Lys Gly Arg Leu Ala Ala Leu Ala lie Ala Ala Met Ala Ser 

20 25 30 

Ala Ser Leu Val Thr Val Ala Val Pro Ala Thr Ala -Asn Ala Asp Pro 

35 40 45 

Glu Pro Ala Pro Pro Val Pro Thr Thr Ala Ala Ser Pro Pro Ser Thr 

50 55 60 

Ala Ala Ala Pro Pro Ala Pro Ala Thr Pro Val Ala Pro Pro Pro Pro 
65 70 75 80 

Ala Ala Ala Asn Thr Pro Asn Ala Gin Pro Gly Asp Pro Asn Ala Ala 

85 90 95 

Pro Pro Pro Ala Asp Pro Asn Ala Pro Pro Pro Pro Val lie Ala Pro 

100 105 110 

Asn Ala, Pro Gin Pro Val Arg Il-e Asp Asn Fro Val Giy Gly Phe Ser 

115 120 125 

Phe Ala Leu Pro Ala Gly Trp Val Glu Ser Asp Ala Ala His Phe Asp 

130 135 140 

Tyr Gly Ser Ala Leu Leu Ser Lys Thr Thr Gly Asp Pro Pro Phe Pro 
145 150 155 160 

Gly Gin Pro Pro Pro Val Ala Asn Asp Thr Arg lie Val Leu Gly Arg 

165 170 % 175 

Leu Asp Gin Lys Leu Tyr Ala Ser Ala Glu Ala Thr Asp Ser Lys Ala 

180 185 190 

Ala Ala Arg Leu Gly Ser Asp Met Gly Glu Phe Tyr Met Pro Tyr Pro 

195 200 205 

Gly Thr Arg lie Asn Gin Glu Thr Val Ser Leu Asp Ala Asn Gly Val 

210 215 220 

Ser Gly Ser Ala Ser Tyr Tyr Glu Val Lys Phe Ser Asp Pro Ser Lys 
225 230 235 240 

Pro Asn Gly Gin lie Trp Thr Gly Val lie Gly Ser Pro Ala Ala Asn 

245 250 255 

Ala Pro Asp Ala Gly Pro Pro Gin Arg Trp Phe Val Val Trp Leu Gly 

260 265 270 

Thr Ala Asn Asn Pro Val Asp Lys Gly Ala Ala Lys Ala Leu Ala Glu 

275 280 285 

Ser lie Arg Pro Leu Val Ala Pro Pro Pro Ala Pro Ala Pro Ala Pro 

290 295 300 

Ala Glu Pro Ala Pro Ala Pro Ala Pro Ala Gly Glu Val Ala Pro Thr 
305 310 315 320 

Pro Thr Thr Pro Thr Pro Gin Arg Thr Leu Pro Ala 



(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 



325 



330 



Asp Pro Val Asp Ala Val lie Asn Thr Thr Xaa Asn Tyr Gly Gin Val 



o 
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15 10 15 

Val Ala Ala Leu 
20 

(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

( D ) TOPOLOGY : 1 inear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

Ala Val Glu Ser Gly Met Leu Ala Leu Gly Thr Pro Ala Pro Ser 

1 5 10 15 

(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 



b (xi) SEQUENCE DESCRIPTION: SEQ ID NO:56: 

fU Ala Ala Met Lys Pro Arg Thr Gly Asp Gly Pro Leu Glu Ala Ala Lys 

Glu Gly Arg 



ry 1 5 10 15 



(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

Tyr Tyr Trp Cys Pro Gly Gin Pro Phe Asp Pro Ala Trp Gly Pro 
15 10 15 

(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 
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Asp lie Gly Ser Glu Ser Thr Glu Asp Gin Gin Xaa Ala Val 
15 10 

(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

Ala Glu Glu Ser lie Ser Thr Xaa Glu Xaa lie Val Pro 
15 10 

(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 

Asp Pro Glu Pro Ala Pro Pro Val Pro Thr Ala Ala Ala Ala Pro Pro 
15 10 15 

Ala 

(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 

Ala Pro Lys Thr Tyr Xaa Glu Glu Leu Lys Gly Thr Asp Thr Gly 
15 10 15 

(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:62: 



Asp Pro Ala Ser Ala Pro Asp Val Pro Thr Ala Ala Gin Gin Thr Ser 
15 10 15 
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Leu Leu Asn Asn Leu Ala Asp Pro Asp Val Ser Phe Ala Asp 
20 25 30 

(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 187 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 



Thr 


Gly 


Ser 


Leu 


Asn 


Gin 


Thr 


His 


Asn Arg Arg Ala Asn Glu Arg Lys 


1 








5 








10 .15 


Asn 


Thr 


Thr 


Met 


Lys 


Met 


Val 


Lys 


Ser He Ala _Ala, Gly Leu Thr Ala 








20 










25 30 


Ala 


Ala 


Ala 


He 


Gly 


Ala 


Ala 


Ala 


Ala Gly Val Thr Ser He Met Ala 






35 










40 


45 


Gly 


Gly 


Pro 


Val 


Val 


Tyr 


Gin 


Met 


Gin Pro Val Val Phe Gly Ala Pro 




50 










55 




60 


Leu 


Pro 


Leu 


Asp 


Pro 


Ala 


Ser 


Ala 


Pro Asp Val Pro Thr Ala Ala Gin 


65 










70 






75 80 


Leu 


Thr 


Ser 


Leu 


Leu 


Asn 


Ser 


Leu 


Ala Asp Pro Asn Val Ser Phe Ala 










85 








90 95 


Asn 


Lys 


Gly 


Ser 


Leu 


Val 


Glu 


Gly 


Gly He Gly Gly Thr Glu Ala Arg 








100 










105 110 


lie 


Ala 


Asp 


His 


Lys 


Leu 


Lys 


Lys 


Ala Ala Glu His Gly Asp Leu Pro 






115 










120 


125 


Leu 


Ser 


Phe 


Ser 


Val 


Thr 


Asn 


He 


Gin Pro Ala Ala Ala Gly Ser Ala 




130 










135 




140 


Thr 


Ala 


Asp 


Val 


Ser 


Val 


Ser 


Gly 


Pro Lys Leu Ser Ser Pro Val Thr 


145 










150 






155 160 


Gin 


Asn 


Val 


Thr 


Phe 


Val 


Asn 


Gin 


Gly Gly Trp Met Leu Ser Arg Ala 










165 








170 175 


Ser 


Ala 


Met 


Glu 


Leu 


Leu 


Gin 


Ala 


Ala Gly Xaa 



180 „ 185 

(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 148 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 

Asp Glu Val Thr Val Glu Thr Thr Ser Val Phe Arg Ala Asp Phe Leu 

15 10 15 

Ser Glu Leu Asp Ala Pro Ala Gin Ala Gly Thr Glu Ser Ala Val Ser 

20 25 30 

Gly Val Glu Gly Leu Pro Pro Gly Ser Ala Leu Leu Val Val Lys Arg 

35 40 45 

Gly Pro Asn Ala Gly Ser Arg Phe Leu Leu Asp Gin Ala He Thr Ser 
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50 










55 






60 






Ala 


Gly 


Arg 


His 


Pro 


Asp 


Ser Asp 


He 


Phe 


Leu Asp Asp 


Val Thr 


Val 


65 










70 








75 




80 


Ser 


Arg 


Arg 


His 


Ala 


Glu 


Phe Arg 


Leu 


Glu 


Asn Asn Glu 


Phe Asn 


Val 










85 








90 




95 




Val 


Asp 


Val 


Gly 


Ser 


Leu 


Asn Gly 


Thr 


Tyr 


Val Asn Arg 


Glu Pro 


Val 








100 








105 






110 




Asp 


Ser 


Ala 


Val 


Leu 


Ala 


Asn Gly 


Asp 


Glu 


Val Gin He 


Gly Lys 


Leu 






115 








120 






125 






Arg 


Leu 


Val 


Phe 


Leu 


Thr 


Gly Pro 


Lys Gin Gly Glu Asp Asp Gly Ser 




130 










135 






140 






Thr 


Gly 


Gly 


Pro 


















145 

























(2) INFORMATION FOR SEQ ID NO: 65: 



ru 

in 



(i) SEQUENCE CHARACTERISTICS: 
M? (A) LENGTH: 230 amino acids 

O (B) TYPE: amino acid 

p (C) STRANDEDNESS : single 

03 (D) TOPOLOGY: linear 

spa 

gj (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 

m Thr Ser Asn Arg Pro Ala Arg Arg Gly Arg Arg Ala Pro Arg Asp Thr 

15 10 15 

q Gly Pro Asp Arg Ser Ala Ser Leu Ser Leu Val Arg His Arg Arg Gin 

~ ~ 20 25 30 

Gin Arg Asp Ala Leu Cys Leu Ser Ser Thr Gin lie Ser Arg Gin Ser 

35 40 45 

Asn Leu Pro Pro Ala Ala Gly Gly Ala Ala Asn Tyr Ser Arg Arg Asn 

50 55 60 

Phe Asp Val Arg lie Lys lie Phe Met Leu Val Thr Ala Val Val Leu 
65 70 75 80 

Leu Cys Cys Ser Gly Val Ala Thr Ala Ala Pro Lys Thr Tyr Cys Glu 

85 90 95 

Glu Leu Lys Gly Thr Asp Thr Gly Gin Ala Cys Gin lie Gin Met Ser 

100 105 110 

Asp Pro Ala Tyr Asn lie Asn lie Ser Leu Pro Ser Tyr Tyr Pro Asp 

115 120 125 

Gin Lys Ser Leu Glu Asn Tyr lie Ala Gin Thr Arg Asp Lys Phe Leu 

130 135 140 

Ser Ala Ala Thr Ser Ser Thr Pro Arg Glu Ala Pro Tyr Glu Leu Asn 
145 150 155 160 

lie Thr Ser Ala Thr Tyr Gin Ser Ala lie Pro Pro Arg Gly Thr Gin 

165 170 175 

Ala Val Val Leu Xaa Val Tyr His Asn Ala Gly Gly Thr His Pro Thr 

180 185 190 

Thr Thr Tyr Lys Ala Phe Asp Trp Asp Gin Ala Tyr Arg Lys Pro lie 

195 200 205 

Thr Tyr Asp Thr Leu Trp Gin Ala Asp Thr Asp Pro Leu Pro Val Val 

210 215 220 

Phe Pro lie Val Ala Arg 
225 230 
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(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 132 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:66: 

Leu Ser Gin Gly Gly Gin Gly Phe 

10 15 
Ala lie Ala Gly Gin lie Arg Ser 
25 30 
lie Gly Pro Thr Ala Phe Leu Gly 
45 

Asn Gly Ala Arg Val Gin Arg Val 
60 

Leu Gly lie Ser Thr Gly Asp Val 

75 80 
lie Asn Ser Ala Thr Ala Met Ala 

90 95 
Gly Asp Val lie Ser Val Asn Trp 
105 110 
Thr Gly Asn Val Thr Leu Ala Glu 
125 



U (A) LENGTH: 100 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 



Val 


Pro 


Leu 


Arg 


Ser 


Pro 


Ser 


Met 


Ser 


Pro Ser Lys 


Cys 


Leu 


Ala 


Ala 


l 








5 










10 






15 




Ala 


Gin 


Arg 


Asn 


Pro 


Val 


He 


Arg Arg 


Arg Arg Leu 


Ser 


Asn 


Pro 


Pro 








20 










25 






30 






Pro 


Arg 


Lys 


Tyr 


Arg 


Ser 


Met 


Pro 


Ser 


Pro Ala Thr 


Ala 


Ser 


Ala 


Gly 






35 










40 






45 








Met 


Ala 


Arg 


Val 


Arg 


Arg 


Arg 


Ala 


He 


Trp Arg Gly 


Pro 


Ala 


Thr 


Xaa 




50 










55 






60 










Ser 


Ala 


Gly 


Met 


Ala 


Arg 


Val 


Arg 


Arg 


Trp Xaa Val 


Met 


Pro 


Xaa 


Val 


65 










70 








75 








80 


He 


Gin 


Ser 


Thr 


Xaa 


He 


Arg 


Xaa 


Xaa 


Gly Pro Phe Asp Asn Arg Gly 










85 










90 






95 




Ser 


Glu 


Arg 


Lys 























100 



b 

n 

i 

W 



Thr Ala Ala Ser Asp Asn Phe Gin 
1 5 
Ala He Pro He Gly Gin Ala Met 
20 

Gly Gly Gly Ser Pro Thr Val His 

35 40 
Leu Gly Val Val Asp Asn Asn Gly 

50 55 
Val Gly Ser Ala Pro Ala Ala Ser 
65 70 
He Thr Ala Val Asp Gly Ala Pro 
85 

Asp Ala Leu Asn Gly His His Pro 
100 

Gin Thr Lys Ser Gly Gly Thr Arg 
115 120 
Gly Pro Pro Ala 
130 



riJ (2) INFORMATION FOR SEQ ID NO: 67: 

m - 

O (i) SEQUENCE CHARACTERISTICS: 



(2) INFORMATION FOR SEQ ID NO: 68: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 3 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 



Met 


Thr 


Asp 


Asp 


He 


Leu 


Leu 


He 


Asp 


Thr 


Asp Glu Arg 


Val Arg Thr 


1 








5 










10 






15 


Leu 


Thr 


Leu 


Asn 


Arg 


Pro 


Gin 


Ser 


Arg 


Asn 


Ala 


Leu Ser 


Ala Ala Leu 








20 










25 








30 


Arg 


Asp 


Arg 


Phe 


Phe 


Ala 


Xaa 


Leu 


Xaa 


Asp 


Ala 


Glu Xaa 


Asp Asp Asp 






35 










40 








45 




He 


Asp 


Val 


Val 


He 


Leu 


Thr 


Gly 


Ala 


Asp 


Pro 


Val Phe 


Cys Ala Gly 




50 










55 










60 




Leu 


Asp 


Leu 


Lys 


Val 


Ala 


Gly 


Arg 


Ala 


Asp 


Arg 


Ala Ala 


Gly His Leu 


65 










70 










75 




80 


Thr 


Ala 


Val 


Gly 


Gly 


His 


Asp 


Gin 


Ala Gly 


Asp 


Arg Arg 


Asp Gin Arg 










85 










90 






95 


Arg 


Arg 


Gly 


His 


Arg 


Arg 


Ala 


Arg 


Thr 


Gly 


Ala 


Val Leu 


Arg His Pro 








100 










105 








110 


Asp 


Arg 


Leu 


Arg 


Ala 


Arg 


Pro 


Leu 


Arg Arg 


His 


Pro Arg 


Pro Gly Gly 






115 










120 








125 




Ala 


Ala 


Ala 


His 


Leu 


Gly 


Thr 


Gin 


Cys 


Val 


Leu 


Ala Ala 


Lys Gly Arg 




130 










135 










140 




His 


Arg 


Xaa 


Gly 


Pro 


Val 


Asp 


Glu 


Pro 


Asp 


Arg 


Arg Leu 


Pro Val Arg 


14 5 










150 










155 




160 


Asp 


Arg 


Arg 























(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 344 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:69: 



Met 


Lys 


Phe 


Val 


Asn 


His 


He 


Glu 


Pro 


Val 


Ala Pro Arg 


Arg Ala Gly 


1 








5 










10 






15 


Gly 


Ala 


Val 


Ala 


Glu 


Val 


Tyr 


Ala 


Glu 


Ala 


Arg Arg Glu 


Phe 


Gly Arg 








20 










25 






30 




Leu 


Pro 


Glu 


Pro 


Leu 


Ala 


Met 


Leu 


Ser 


Pro 


Asp Glu Gly 


Leu 


Leu Thr 






35 










40 






45 






Ala 


Gly 


Trp 


Ala 


Thr 


Leu 


Arg 


Glu 


Thr 


Leu 


Leu Val Gly 


Gin 


Val Pro 




50 










55 








60 






Arg 


Gly 


Arg 


Lys 


Glu 


Ala 


Val 


Ala 


Ala 


Ala 


Val Ala Ala 


Ser 


Leu Arg 


65 










70 










75 




80 


Cys 


Pro 


Trp 


Cys 


Val 


Asp 


Ala 


His 


Thr 


Thr 


Met Leu Tyr 


Ala 


Ala Gly 










85 










90 






95 


Gin 


Thr 


Asp 


Thr 


Ala 


Ala 


Ala 


He 


Leu 


Ala 


Gly Thr Ala 


Pro 


Ala Ala 








100 










105 






110 






100 



Gly 


Asp 


Pro 


Asn 


Ala Pro 


Tyr Val Ala Trp 


Ala 


Ala 


Gly Thr 


Gly 


Thr 






115 






120 






125 








Pro 


Ala 


Gly 


Pro 


Pro Ala 


Pro Phe Gly Pro 


Asp 


Val 


Ala 


Ala 


Glu 


Tyr 




130 








135 




140 










Leu 


Gly 


Thr 


Ala 


Val Gin 


Phe His Phe He 


Ala 


Arg 


Leu 


Val 


Leu 


Val 


145 








150 




155 










160 


Leu 


Leu 


Asp 


Glu 


Thr Phe 


Leu Pro Gly Gly 


Pro 


Arg 


Ala 


Gin 


Gin 


Leu 










165 


170 










175 




Met 


Arg 


Arg 


Ala Gly Gly 


Leu Val Phe Ala 


Arg 


Lys 


Val 


Arg 


Ala 


Glu 








180 




185 








190 






His 


Arg 


Pro 


Gly Arg Ser 


Thr Arg Arg Leu 


Glu 


Pro Arg 


Thr 


Leu 


Pro 






195 






200 






205 








Asp 


Asp 


Leu 


Ala Trp Ala 


Thr Pro Ser Glu 


Pro 


He 


Ala 


Thr 


Ala 


Phe 




210 








215 




220 










Ala 


Ala 


Leu 


Ser 


His His 


Leu Asp Thr Ala 


Pro 


His 


Leu 


Pro 


Pro 


Pro 


225 








230 




235 










240 


Thr 


Arg 


Gin 


Val 


Val Arg 


Arg Val Val Gly 


Ser 


Trp 


His 


Gly 


Glu 


Pro 










245 


250 










255 




Met 


Pro 


Met 


Ser 


Ser Arg 


Trp Thr Asn Glu 


His 


Thr 


Ala 


Glu 


Leu 


Pro 








260 




265 








270 






Ala 


Asp 


Leu 


His 


Ala Pro 


Thr Arg Leu Ala 


Leu 


Leu 


Thr Gly 


Leu 


Ala 






275 






280 






285 








Pro 


His 


Gin 


Val 


Thr Asp 


Asp Asp Val Ala 


Ala 


Ala 


Arg 


Ser 


Leu 


Leu 




290 








295 




300 










Asp 


Thr 


Asp 


Ala 


Ala Leu 


Val Gly Ala Leu 


Ala 


Trp 


Ala 


Ala 


Phe 


Thr 


305 








310 




315 










320 


Ala 


Ala 


Arg 


Arg 


lie Gly 


Thr Trp He Gly 


Ala 


Ala 


Ala 


Glu 


Gly 


Gin 










325 


330 










335 




Val 


Ser 


Arg 


Gin 


Asn Pro 


Thr Gly 















340 



(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 485 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 

Asp Asp Pro Asp Met Pro Gly Thr Val Ala Lys Ala Val Ala Asp Ala 

15 10 15 

Leu Gly Arg Gly He Ala Pro Val Glu Asp He Gin Asp Cys Val Glu 

20 25 30 

Ala Arg Leu Gly Glu Ala Gly Leu Asp Asp Val Ala Arg Val Tyr He 

35 40 45 

He Tyr Arg Gin Arg Arg Ala Glu Leu Arg Thr Ala Lys Ala Leu Leu 

50 55 60 

Gly Val Arg Asp Glu Leu Lys Leu Ser Leu Ala Ala Val Thr Val Leu 
65 70 75 80 

Arg Glu Arg Tyr Leu Leu His Asp Glu Gin Gly Arg Pro Ala Glu Ser 

85 90 95 

Thr Gly Glu Leu Met Asp Arg Ser Ala Arg Cys Val Ala Ala Ala Glu 
100 105 110 



101 

Asp Gin Tyr Glu Pro Gly Ser Ser Arg Arg Trp Ala Glu Arg Phe Ala 

115 120 125 

Thr Leu Leu Arg Asn Leu Glu Phe Leu Pro Asn Ser Pro Thr Leu Met 

130 135 140 

Asn Ser Gly Thr Asp Leu Gly Leu Leu Ala Gly Cys Phe Val Leu Pro 
145 150 155 - 160 

lie Glu Asp Ser Leu Gin Ser lie Phe Ala Thr Leu Gly Gin Ala Ala 

165 170 175 

Glu Leu Gin Arg Ala Gly Gly Gly Thr Gly Tyr Ala Phe Ser His Leu 

180 185 190 

Arg Pro Ala Gly Asp Arg Val Ala Ser Thr Gly Gly Thr Ala Ser Gly 

195 200 205 

Pro Val Ser Phe Leu Arg Leu Tyr Asp Ser Ala Ala Gly Val Val Ser 

210 215 220 

Met Gly Gly Arg Arg Arg Gly Ala Cys Met Ala Val Leu Asp Val Ser 
225 230 23 5 240 

His Pro Asp lie Cys Asp Phe Val Thr Ala Lys Ala Glu Ser Pro Ser 
M* 245 250 255 

p Glu Leu Pro His Phe Asn Leu Ser Val Gly Val Thr Asp Ala Phe Leu 

p 260 265 270 

qj Arg Ala Val Glu Arg Asn Gly Leu His Arg Leu Val Asn Pro Arg Thr 

i 275 280 285 

fy Gly Lys lie Val Ala Arg Met Pro Ala Ala Glu Leu Phe Asp Ala lie 

> 290 295 300 

yj Cys Lys Ala Ala His Ala Gly Gly Asp Pro Gly Leu Val Phe Leu Asp 

5 305 310 315 320 

q Thr lie Asn Arg Ala Asn Pro Val Pro Gly Arg Gly Arg lie Glu Ala 

*y 325 330 335 

k| Thr Asn Pro Cys Gly Glu Val Pro Leu Leu Pro Tyr Glu Ser Cys Asn 

340 345 350 

%l Leu Gly Ser lie Asn Leu Ala Arg Met Leu Ala Asp Gly Arg Val Asp 

hi 355 360 365 

f ~ Trp Asp Arg Leu Glu Glu Val Ala Gly Val Ala Val Arg Phe Leu Asp 

370 375 380 

Asp Val He Asp Val Ser Arg Tyr Pro Phe Pro Glu Leu Gly Glu Ala 
385 390 395 400 

Ala Arg Ala Thr Arg Lys He Gly Leu Gly Val Met Gly Leu Ala Glu 

405 410 415 

Leu Leu Ala Ala Leu Gly He Pro Tyr Asp Ser Glu Glu Ala Val Arg 

420 425 430 

Leu Ala Thr Arg Leu Met Arg Arg He Gin Gin Ala Ala His Thr Ala 

435 440 445 

Ser Arg Arg Leu Ala Glu Glu Arg Gly Ala Phe Pro Ala Phe Thr Asp 

450 455 460 

Ser Arg Phe Ala Arg Ser Gly Pro Arg Arg Asn Ala Gin Val Thr Ser 
465 470 475 480 

Val Ala Pro Thr Gly 
485 



(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 267 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
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(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71:. 



Gly 


Val 


lie 


Val 


Leu 


Asp 


Leu 


Glu 


Pro 


Arg 


Gly Pro 


Leu Pro 


Thr 


Glu 


1 








5 










10 






15 




lie 


Tyr 


Trp 


Arg 


Arg 


Arg 


Gly 


Leu 


Ala 


Leu 


Gly He 


Ala Val 


Val 


Val 








2 0 










25 






30 






Val 


Gly 


lie 


Ala 


Val 


Ala 


lie 


Val 


He 


Ala 


Phe Val 


Asp Ser 


Ser 


Ala 






35 










40 








45 






Gly 


Ala 


Lys 


Pro 


Val 


Ser 


Ala 


Asp 


Lys 


Pro 


Ala Ser 


Ala Gin 


Ser 


His 




50 










55 








60 








Pro 


Gly 


Ser 


Pro 


Ala 


Pro 


Gin 


Ala 


Pro 


Gin 


Pro Ala Gly Gin 


Thr 


Glu 


65 










70 










75 






80 


Gly 


Asn 


Ala 


Ala 


Ala 


Ala 


Pro 


Pro 


Gin Gly 


Gin Asn 


Pro Glu 


Thr 


Pro 










85 










90 






95 




Thr 


Pro 


Thr 


Ala 


Ala 


Val 


Gin 


Pro 


Pro 


Pro 


Val Leu 


Lys Glu 


Gly 


Asp 








100 










105 






110 






Asp 


Cys 


Pro 


Asp 


Ser 


Thr 


Leu 


Ala 


Val 


Lys 


Gly Leu Thr Asn 


Ala 


Pro 






115 










120 








125 






Gin 


Tyr 


Tyr 


Val 


Gly 


Asp 


Gin 


Pro 


Lys 


Phe 


Thr Met 


Val Val 


Thr 


Asn 




130 










135 








140 








lie 


Gly 


Leu 


Val 


Ser 


Cys 


Lys 


Arg 


Asp Val 


Gly Ala Ala Val 


Leu 


Ala 


145 










150 










155 






160 


Ala 


Tyr 


Val 


Tyr 


Ser 


Leu 


Asp 


Asn 


Lys 


Arg 


Leu Trp 


Ser Asn 


Leu 


Asp 










165 










170 






175 




Cys 


Ala 


Pro 


Ser 


Asn 


Glu 


Thr 


Leu 


Val 


Lys 


Thr Phe 


Ser Pro 


Gly 


Glu 








180 










185 






190 






Gin 


Val 


Thr 


Thr 


Ala 


Val 


Thr 


Trp 


Thr 


Gly 


Met Gly 


Ser Ala 


Pro 


Arg 






195 










200 








205 






Cys 


Pro 


Leu 


Pro 


Arg 


Pro 


Ala 


He 


Gly 


Pro 


Gly Thr 


Tyr Asn 


Leu 


Val 




210 










215 








220 








Val 


Gin 


Leu 


Gly 


Asn 


Leu 


Arg 


Ser 


Leu 


Pro 


Val Pro 


Phe He 


Leu 


Asn 


225 










230 










235 






240 


Gin 


Pro 


Pro 


Pro 


Pro 


Pro 


Gly 


Pro 


Val 


Pro 


Ala Pro Gly Pro 


Ala 


Gin 










245 










250 






255 




Ala 


Pro 


Pro 


Pro 


Glu 


Ser 


Pro 


Ala 


Gin Gly 


Gly 









260 265 



INFORMATION FOR SEQ ID NO: 72 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 97 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:72: 



Leu He Ser Thr Gly Lys Ala Ser His Ala Ser Leu Gly Val Gin Val 

15 10 15 

Thr Asn Asp Lys Asp Thr Pro Gly Ala Lys lie Val Glu Val Val Ala 

20 25 30 

Gly Gly Ala Ala Ala Asn Ala Gly Val Pro Lys Gly Val Val Val Thr 
35 40 45 



- OS 

s 



ru 
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Lys Val Asp Asp Arg Pro lie Asn Ser Ala Asp Ala Leu Val Ala Ala 

50 55 60 

Val Arg Ser Lys Ala Pro Gly Ala Thr Val Ala Leu Thr Phe Gin Asp 
65 70 75 80 

Pro Ser Gly Gly Ser Arg Thr Val Gin Val Thr Leu Gly Lys Ala Glu 
85 90 95 

Gin 

(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 364 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(x-i) SEQUENCE DESCRIPTTON: SEQ ID NO: 73: 



Gly 


Ala 


Ala 


Val 


Ser 


Leu 


Leu 


Ala 


Ala 


Gly 


Thr 


Leu 


Val 


Leu 


Thr 


Ala 


1 








5 










10 










15 




Cys 


Gly 


Gly 


Gly 


Thr 


Asn 


Ser 


Ser 


Ser 


Ser 


Gly Ala 


Gly 


Gly 


Thr 


Ser 








20 










25 










30 






Gly 


Ser 


Val 


His 


Cys 


Gly 


Gly 


Lys 


Lys 


Glu 


Leu 


His 


Ser 


Ser 


Gly 


Ser 






35 










40 










45 








Thr 


Ala 


Gin 


Glu 


Asn 


Ala 


Met 


Glu 


Gin 


Phe 


Val 


Tyr 


Ala 


Tyr 


Val 


Arg 




50 










55 










60 










Ser 


Cys 


Pro 


Gly 


Tyr 


Thr 


Leu 


Asp 


Tyr 


Asn 


Ala 


Asn 


Gly 


Ser 


Gly 


Ala 


65 










70 










75 










80 


Gly 


Val 


Thr 


Gin 


Phe 


Leu 


Asn 


Asn 


Glu 


Thr 


Asp 


Phe 


Ala 


Gly 


Ser 


Asp 










85 










90 










95 




Val 


Pro 


Leu 


Asn 


Pro 


Ser 


Thr 


Gly 


Gin 


Pro 


Asp 


Arg 


Ser 


Ala 


Glu 


Arg 








100 










105 










110 






Cys 


Gly 


Ser 


Pro 


Ala 


Trp 


Asp 


Leu 


Pro 


Thr 


Val 


Phe 


Gly 


Pro 


He 


Ala 






115 










120 










125 








lie 


Thr 


Tyr 


Asn 


He 


Lys 


Gly 


Val 


Ser 


Thr 


Leu 


Asn 


Leu 


Asp 


Gly 


Pro 




130 










135 










140 










Thr 


Thr 


Ala 


Lys 


He 


Phe 


Asn 


Gly 


Thr 


He 


Thr 


Val 


Trp 


Asn 


Asp 


Pro 


145 










150 










155 










160 


Gin 


lie 


Gin 


Ala 


Leu 


Asn 


Ser 


Gly 


Thr 


Asp 


Leu 


Pro 


Pro 


Thr 


Pro 


He 










165 










170 










175 




Ser 


Val 


He 


Phe 


Arg 


Ser 


Asp 


Lys 


Ser 


Gly 


Thr 


Ser 


Asp 


Asn 


Phe 


Gin 








180 










185 










190 






Lys 


Tyr 


Leu 


Asp 


Gly 


Val 


Ser 


Asn 


Gly 


Ala 


Trp 


Gly 


Lys 


Gly 


Ala 


Ser 






195 










200 










205 








Glu 


Thr 


Phe 


Ser 


Gly 


Gly 


Val 


Gly 


Val 


Gly 


Ala 


Ser 


Gly 


Asn 


Asn 


Gly 




210 










215 










220 










Thr 


Ser 


Ala 


Leu 


Leu 


Gin 


Thr 


Thr 


Asp 


Gly 


Ser 


He 


Thr 


Tyr 


Asn 


Glu 


225 










230 










235 










240 


Trp 


Ser 


Phe 


Ala 


Val 


Gly 


Lys 


Gin 


Leu 


Asn 


Met 


Ala 


Gin 


He 


He 


Thr 










245 










250 










255 




Ser 


Ala 


Gly 


Pro 


Asp 


Pro 


Val 


Ala 


He 


Thr 


Thr 


Glu 


Ser 


Val 


Gly 


Lys 








260 










265 










270 






Thr 


lie 


Ala 


Gly 


Ala 


Lys 


He 


Met 


Gly 


Gin 


Gly Asn 


Asp 


Leu 


Val 


Leu 






275 










280 










285 








Asp 


Thr 


Ser 


Ser 


Phe 


Tyr 


Arg 


Pro 


Thr 


Gin 


Pro Gly 


Ser 


Tyr 


Pro 


He 



a 

Q 

m 

S tSf 

ru 
m 
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290 295 300 

Val Leu Ala Thr Tyr Glu lie Val Cys Ser Lys Tyr Pro Asp Ala Thr 
305 310 315 320 

Thr Gly Thr Ala Val Arg Ala Phe Met Gin Ala Ala lie Gly Pro Gly 

325 330 335 

Gin Glu Gly Leu Asp Gin Tyr Gly Ser lie Pro Leu Pro Lys Ser Phe 

340 345 350 

Gin Ala Lys Leu Ala Ala Ala Val Asn Ala lie Ser 
355 360 

(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 309 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
(T» TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 



Gin 


Ala 


Ala 


Ala 


Gly 


Arg 


Ala 


Val 


Arg 


Arg 


Thr Gly 


His 


Ala 


Glu Asp 


1 








5 










10 








15 


Gin 


Thr 


His 


Gin 


Asp 


Arg 


Leu 


His 


His 


Gly 


Cys Arg 


Arg 


Ala 


Ala Val 








20 










25 








30 




Val 


Val 


Arg 


Gin 


Asp 


Arg 


Ala 


Ser 


Val 


Ser 


Ala Thr 


Ser 


Ala 


Arg Pro 






35 










40 








45 






Pro 


Arg 


Arg 


His 


Pro 


Ala 


Gin 


Gly 


His 


Arg 


Arg Arg 


Val 


Ala 


Pro Ser 




50 










55 








60 








Gly 


Gly 


Arg 


Arg 


Arg 


Pro 


His 


Pro 


His 


His 


Val Gin 


Pro 


Asp 


Asp Arg 


65 










70 










75 






80 


Arg 


Asp 


Arg 


Pro 


Ala 


Leu 


Leu 


ASP 


Arg 


Thr 


Gin Pro 


Ala 


Glu 


His Pro 










85 










90 








95 


Asp 


Pro 


His 


Arg 


Arg 


Gly 


Pro 


Ala 


Asp 


Pro 


Gly Arg 


Val 


Arg 


Gly Arg 








100 










105 








110 




Gly 


Arg 


Leu 


Arg 


Arg 


Val 


Asp 


Asp 


Gly 


Arg 


Leu Gin 


Pro 


Asp 


Arg Asp 






115 










120 








125 






Ala 


Asp 


His 


Gly 


Ala 


Pro 


Val 


Arg 


Gly 


Arg 


Gly Pro 


His 


Arg 


Gly Val 




130 










135 








140 








Gin 


His 


Arg 


Gly 


Gly 


Pro 


Val 


Phe 


Val 


Arg 


Arg Val 


Pro 


Gly 


Val Arg 


145 










150 










155 






160 


Cys 


Ala 


His 


Arg 


Arg 


Gly 


His 


Arg 


Arg 


Val 


Ala Ala 


Pro 


Gly 


Gin Gly 










165 










170 








175 


Asp 


Val 


Leu 


Arg 


Ala 


Gly 


Leu 


Arg 


Val 


Glu 


Arg Leu 


Arg 


Pro 


Val Ala 








180 










185 








190 




Ala 


Val 


Glu 


Asn 


Leu 


His 


Arg 


Gly 


Ser 


Gin 


Arg Ala 


Asp 


Gly 


Arg Val 






195 










200 








205 






Phe 


Arg 


Pro 


He 


Arg 


Arg 


Gly 


Ala 


Arg 


Leu 


Pro Ala 


Arg 


Arg 


Ser Arg 




210 










215 








220 








Ala 


Gly 


Pro 


Gin 


Gly 


Arg 


Leu 


His 


Leu 


Asp 


Gly Ala 


Gly 


Pro 


Ser Pro 


225 










230 










235 






240 


Leu 


Pro 


Ala 


Arg 


Ala 


Gly 


Gin 


Gin 


Gin 


Pro 


Ser Ser 


Ala 


Gly 


Gly Arg 










245 










250 








255 


Arg 


Ala 


Gly 


Gly 


Ala 


Glu 


Arg 


Ala 


Asp 


Pro 


Gly Gin 


Arg 


Gly 


Arg His 








260 










265 








270 




His 


Gin 


Gly 


Gly 


His 


Asp 


Pro 


Gly 


Arg 


Gin 


Gly Ala 


Gin 


Arg 


Gly Thr 



105 



275 280 285 

Ala Gly Val Ala His Ala Ala Ala Gly Pro Arg Arg Ala Ala Val Arg 

290 295 300 

Asn Arg Pro Arg Arg 
305 

INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 580 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 



Ser 


Ala 


Val 


Trp 

ST 


Cys 


Leu 


Asn 


Gly 


Phe 


Thr Gly Arg His Arg His Gly 


l 








5 










10 




15 


Arg 


Cys 


Arg 


Val 


Arcr 


Ala 


Ser 


Gly 


Trr> 


Arg 


Ser 


Ser Asn Arg Trp Cys 








20 










25 






30 


Ser 


Thr 


Thr 


Ala 


Asp 


Cys 


Cys 


Ala 


Ser 


Lys 


Thr 


Pro Thr Gin Ala Ala 






35 










40 








45 


Ser 


Pro 


Leu 


Glu 


Arg 


Arcr 


Phe 


Thr 


Cvs 


Cys 


Ser 


Pro Ala Val Gly Cys 




50 










55 










60 


Arg 


Phe 


Arcr 


Ser 


Phe 


Pro 


Val 


Arcr 


Arci 


Leu 


Ala 


Leu Gly Ala Arg Thr 


65 










70 










75 


80 


Ser 


Arq 


Thr 


Leu 


Glv 


Val 


Arcr 


Arcr 


Thr 


Leu 


Ser 


Gin Trp Asn Leu Ser 










85 










90 




95 


Pro 


Arcr 


Ala 


Gin 


Pro 


Ser 


Cys 


Ala 


Val 


Thr 


Val 


Glu Ser His Thr His 








100 










105 






110 


Ala 


Ser 


Pro 


Arg 


Met 


Ala 


Lys 


Leu 


Ala 


Arg Val 


Val Gly Leu Val Gin 






115 










120 








125 


Glu 


Glu 


Gin 


Pro 


Ser 


Asp 


Met 


Thr 


Asn 


His Pro Arg Tyr Ser Pro Pro 




130 










135 










14 0 


Pro 


Gin 


Gin 


Pro 


Gly 


Thr 


Pro 


Gly 


Tyr 


Ala Gin Gly Gin Gin Gin Thr 


145 










150 










155 


160 


Tyr 


Ser 


Gin 


Gin 


Phe 


Asp 


Trp 


Arg 


Tyr 


Pro 


Pro 


Ser Pro Pro Pro Gin 










165 










170 




175 


Pro 


Thr 


Gin 


Tyr 


Arg 


Gin 


Pro 


Tyr 


Glu 


Ala 


Leu Gly Gly Thr Arg Pro 








180 










185 






190 


Gly 


Leu 


He 


Pro 


Gly 


Val 


He 


Pro 


Thr 


Met 


Thr 


Pro Pro Pro Gly Met 






195 










200 








205 


Val 


Arg 


Gin 


Arg 


Pro 


Arg 


Ala 


Gly 


Met 


Leu 


Ala 


He Gly Ala Val Thr 




210 










215 










220 


He 


Ala 


Val 


Val 


Ser 


Ala 


Gly 


He 


Gly 


Gly Ala Ala Ala Ser Leu Val 


225 










230 










235 


240 


Gly 


Phe 


Asn 


Arg 


Ala 


Pro 


Ala 


Gly 


Pro 


Ser Gly Gly Pro Val Ala Ala 










245 










250 




255 


Ser 


Ala 


Ala 


Pro 


Ser 


He 


Pro 


Ala 


Ala 


Asn 


Met 


Pro Pro Gly Ser Val 








260 










265 






270 


Glu 


Gin 


Val 


Ala 


Ala 


Lys 


Val 


Val 


Pro 


Ser 


Val 


Val Met Leu Glu Thr 






275 










280 








285 


Asp 


Leu 


Gly 


Arg 


Gin 


Ser 


Glu 


Glu 


Gly 


Ser Gly He He Leu Ser Ala 




290 










295 










300 


Glu 


Gly 


Leu 


He 


Leu 


Thr 


Asn 


Asn 


His 


Val 


He 


Ala Ala Ala Ala Lys 
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305 310 315 320 

Pro Pro Leu Gly Ser Pro Pro Pro Lys Thr Thr Val Thr Phe Ser Asp 

325 330 335 

Gly Arg Thr Ala Pro Phe Thr Val Val Gly Ala Asp Pro Thr Ser Asp 

340 345 350 

lie Ala Val Val Arg Val Gin Gly Val Ser Gly Leu .Thr Pro lie Ser 

355 360 '365 

Leu Gly Ser Ser Ser Asp Leu Arg Val Gly Gin Pro Val Leu Ala lie 

370 375 380 

Gly Ser Pro Leu Gly Leu Glu Gly Thr Val Thr Thr Gly lie Val Ser 
385 390 395 400 

Ala Leu Asn Arg Pro Val Ser Thr Thr Gly Glu Ala Gly Asn Gin Asn 

405 410 415 

Thr Val Leu Asp Ala lie Gin Thr Asp Ala Ala lie Asn Pro Gly Asn 

420 425 430 

Ser Gly Gly Ala Leu Val Asn Met Asn Ala Gin Leu Val Gly' Val Asn 

435 440 - 445 

Ser Ala He Ala Thr Leu Gly Ala Asp Ser Ala Asp Ala Gin Ser Gly 

450 455 460 

Ser He Gly Leu Gly Phe Ala He Pro Val Asp Gin Ala Lys Arg He 
465 470 475 480 

Ala Asp Glu Leu He Ser Thr Gly Lys Ala Ser His Ala Ser Leu Gly 

485 490 495 

Val Gin Val Thr Asn Asp Lys Asp Thr Pro Gly Ala Lys He Val Glu 

500 505 510 

Val Val Ala Gly Gly Ala Ala Ala Asn Ala Gly Val Pro Lys Gly Val 

515 520 525 

Val Val Thr Lys Val Asp Asp Arg Pro He Asn Ser Ala Asp Ala Leu 

530 535 540 

Val Ala Ala Val Arg Ser Lys Ala Pro Gly Ala Thr Val Ala Leu Thr 
545 550 555 560 

Phe Gin Asp Pro Ser Gly Gly Ser Arg Thr Val Gin Val Thr Leu Gly 
565 570 575 

Lys Ala Glu Gin 
580 



INFORMATION FOR SEQ ID NO: 76: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 233 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 

Met Asn Asp Gly Lys Arg Ala Val Thr Ser Ala Val Leu Val Val Leu 

1 5 10 15 

Gly Ala Cys Leu Ala Leu Trp Leu Ser Gly Cys Ser Ser Pro Lys Pro 

20 25 30 

Asp Ala Glu Glu Gin Gly Val Pro Val Ser Pro Thr Ala Ser Asp Pro 

35 40 45 

Ala Leu Leu Ala Glu He Arg Gin Ser Leu Asp Ala Thr Lys Gly Leu 

50 55 60 

Thr Ser Val His Val Ala Val Arg Thr Thr Gly Lys Val Asp Ser Leu 



Q 
O 



s 

O 



107 



65 70 75 80 

Leu Gly lie Thr Ser Ala Asp Val Asp Val Arg Ala Asn Pro Leu Ala 

85 90 95 

Ala Lys Gly Val Cys Thr Tyr Asn Asp Glu Gin Gly Val Pro Phe Arg 

100 105 110 

Val Gin Gly Asp Asn lie Ser Val Lys Leu Phe Asp .Asp Trp Ser Asn 

115 120 125 

Leu Gly Ser lie Ser Glu Leu Ser Thr Ser Arg Val Leu Asp Pro Ala 

130 135 140 

Ala Gly Val Thr Gin Leu Leu Ser Gly Val Thr Asn Leu Gin Ala Gin 
145 150 155 160 

Gly Thr Glu Val lie Asp Gly lie Ser Thr Thr Lys lie Thr Gly Thr 

165 170 175 

lie Pro Ala Ser Ser Val Lys Met Leu Asp Pro Gly Ala Lys Ser Ala 

180 185 190 

Arg Pro Ala Thr Val Trp He Ala Gin Asp Gly Ser His His Leu Val 

195 200 205 

Arg Ala Ser He Asp Leu Gly Ser Gly Ser He Gin Leu Thr Gin Ser 

210 215 220 

Lys Trp Asn Glu Pro Val Asn Val Asp 
225 230 

(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 

Val He Asp He He Gly Thr Ser Pro Thr Ser Trp Glu Gin Ala Ala 

1 5 10 15 

Ala Glu Ala Val Gin Arg Ala Arg Asp Ser Val Asp Asp He Arg Val 

20 25 30 

Ala Arg Val He Glu Gin Asp Met Ala Val Asp Ser Ala Gly Lys He 

35 40 45 

Thr Tyr Arg He Lys Leu Glu Val Ser Phe Lys Met Arg Pro Ala Gin 

50 55 60 

Pro Arg 
65 

(2) INFORMATION FOR SEQ ID NO: 78: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 69 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:78: 



Val Pro Pro Ala Pro Pro Leu Pro Pro Leu Pro Pro Ser Pro He Ser 
15 10 15 



yy 

5 



ry 
in 

nj 
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Cys Ala Ser Pro Pro Ser Pro Pro Leu Pro Pro Ala Pro Pro Val Ala 

20 25 30 

Pro Gly Pro Pro Met Pro Pro Leu Asp Pro Trp Pro Pro Ala Pro Pro 

35 40 45 

Leu Pro Tyr Ser Thr Pro Pro Gly Ala Pro Leu Pro Pro Ser Pro Pro 

50 55 60 

Ser Pro Pro Leu Pro 
65 

(2) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 355 amino acids 

(B) TYPE : amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 



Met 


Ser 


Asn 


Ser 


Arg 


Arg 


Arg 


Ser 


Leu Arg 


Trp 


Ser 


Trp 


Leu 


Leu 


Ser 


1 








5 










10 










15 




Val 


Leu 


Ala 


Ala 
20 


Val 


Gly 


Leu 


Gly 


Leu 
25 


Ala 


Thr 


Ala 


Pro 


Ala 
30 


Gin 


Ala 


Ala 


Pro 


Pro 
35 


Ala 


Leu 


Ser 


Gin 


Asp 
40 


Arg 


Phe 


Ala 


Asp 


Phe 
45 


Pro 


Ala 


Leu 


Pro 


Leu 
50 


Asp 


Pro 


Ser 


Ala 


Met 
55 


Val 


Ala 


Gin 


Val 


Ala 
60 


Pro 


Gin 


Val 


Val 


Asn 


He 


Asn 


Thr 


Lys 


Leu 


Gly 


Tyr 


Asn 


Asn 


Ala 


Val 


Gly 


Ala 


Gly 


Thr 


65 










70 










75 










80 


Gly 


He 


Val 


He 


Asp 
85 


Pro 


Asn 


Gly 


Val 


Val 
90 


Leu 


Thr 


Asn 


Asn 


His 
95 


Val 


He 


Ala 


Gly 


Ala 
100 


Thr 


Asp 


He 


Asn 


Ala 
105 


Phe 


Ser 


Val 


Gly 


Ser 
110 


Gly 


Gin 


Thr 


Tyr 


Gly 


Val 


Asp 


Val 


Val 


Gly 


Tyr Asp 


Arg 


Thr 


Gin 


Asp 


Val 


Ala 






115 










120 










125 








Val 


Leu 


Gin 


Leu 


Arg 


Gly 


Ala 


Gly 


Gly Leu 


Pro 


Ser 


Ala 


Ala 


He 


Gly 




130 










135 










140 










Gly 


Gly 


Val 


Ala 


Val 


Gly 


Glu 


Pro 


Val 


Val 


Ala 


Met 


Gly 


Asn 


Ser 


Gly 


145 










150 










155 










160 


Gly 


Gin 


Gly 


Gly 


Thr 


Pro 


Arg 


Ala 


Val 


Pro 


Gly Arg 


Val 


Val 


Ala 


Leu 










165 










170 










175 




Gly 


Gin 


Thr 


Val 
180 


Gin 


Ala 


Ser 


Asp 


Ser 
185 


Leu 


Thr 


Gly 


Ala 


Glu 
190 


Glu 


Thr 


Leu 


Asn 


Gly 
195 


Leu 


He 


Gin 


Phe 


Asp 
200 


Ala 


Ala 


He 


Gin 


Pro 
205 


Gly 


Asp 


Ser 


Gly 


Gly 


Pro 


Val 


Val 


Asn 


Gly 


Leu 


Gly Gin 


Val 


Val 


Gly 


Met 


Asn 


Thr 




210 










215 










220 










Ala 


Ala 


Ser 


Asp 


Asn 


Phe 


Gin 


Leu 


Ser 


Gin 


Gly Gly 


Gin 


Gly 


Phe 


Ala 


225 










230 










235 










240 


He 


Pro 


He 


Gly 


Gin 


Ala 


Met 


Ala 


He 


Ala 


Gly Gin 


He 


Arg 


Ser 


Gly 










245 










250 










255 




Gly 


Gly 


Ser 


Pro 


Thr 


Val 


His 


He 


Gly Pro 


Thr 


Ala 


Phe 


Leu 


Gly 


Leu 








260 










265 










270 






Gly 


Val 


Val 


Asp 


Asn 


Asn 


Gly 


Asn 


Gly Ala 


Arg 


Val 


Gin 


Arg 


Val 


Val 






275 










280 










285 
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Gly Ser Ala Pro Ala Ala 


Ser Leu Gly lie 


Ser 


Thr 


Gly Asp 


Val 


He 


290 


295 




300 










Thr Ala Val Asp Gly Ala 


Pro lie Asn Ser 


Ala 


Thr 


Ala 


Met 


Ala 


Asp 


305 310 




315 










320 


Ala Leu Asn Gly His His 


Pro Gly Asp Val 


He 


Ser 


Val 


Asn 


Trp 


Gin 


325 


330 










335 




Thr Lys Ser Gly Gly Thr Arg Thr Gly Asn Val 


Thr 


Leu 


Ala 


Glu Gly 


340 


345 








350 






Pro Pro Ala 

















355 



(2) INFORMATION FOR SEQ ID NO: 80: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 205 amino acids 
■(B) TYPE: amino acid 
(C) STRANDEDNESS : single 
M= (D) TOPOLOGY: linear 

n 

O (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 



Ly 

£ 

O 

ry 

; s : 

s TBT 

m 

fii 



Ser 


Pro 


Lys 


Pro 


Asp Ala 


Glu 


Glu 


Gin Gly Val 


Pro 


Val Ser 


Pro 


Thr 


1 








5 








10 








15 




Ala 


Ser 


Asp 


Pro 


Ala Leu 


Leu 


Ala 


Glu 


He 


Arg 


Gin 


Ser Leu 


Asp 


Ala 








20 








25 








30 






Thr 


Lys 


Gly 


Leu 


Thr Ser 


Val 


His 


Val 


Ala 


Val 


Arg 


Thr Thr 


Gly 


Lys 






35 








40 










45 






Val 


Asp 


Ser 


Leu 


Leu Gly 


He 


Thr 


Ser 


Ala 


Asp 


Val 


Asp Val 


Arg 


Ala 




50 








55 










60 








Asn 


Pro 


Leu 


Ala 


Ala Lys 


Gly 


Val 


Cys 


Thr 


Tyr 


Asn 


Asp Glu 


Gin 


Gly 


65 








70 










75 








80 


Val 


Pro 


Phe 


Arg 


Val Gin 


Gly 


Asp 


Asn 


He 


Ser 


Val 


Lys Leu 


Phe 


Asp 










85 








90 








95 




Asp 


Trp 


Ser 


Asn 


Leu Gly 


Ser 


He 


Ser 


Glu 


Leu 


Ser 


Thr Ser 


Arg 


Val 








100 








105 








110 






Leu 


Asp 


Pro 


Ala 


Ala Gly 


Val 


Thr 


Gin 


Leu 


Leu 


Ser 


Gly Val 


Thr 


Asn 






115 








120 










125 






Leu 


Gin 


Ala 


Gin 


Gly Thr 


Glu 


Val 


He Asp Gly He 


Ser Thr 


Thr 


Lys 




130 








135 










140 








He 


Thr 


Gly 


Thr 


He Pro 


Ala 


Ser 


Ser 


Val 


Lys 


Met 


Leu Asp 


Pro Gly 


145 








150 










155 








160 


Ala 


Lys 


Ser 


Ala 


Arg Pro 


Ala 


Thr 


Val 


Trp 


He 


Ala 


Gin Asp 


Gly 


Ser 










165 








170 








175 




His 


His 


Leu 


Val 


Arg Ala 


Ser 


He 


Asp Leu Gly Ser 


Gly Ser 


He 


Gin 








180 








185 








190 






Leu 


Thr 


Gin 


Ser 


Lys Trp 


Asn 


Glu 


Pro 


Val 


Asn 


Val 


Asp 










195 








200 










205 







(2) INFORMATION FOR SEQ ID NO: 81: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 286 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 





Gly 


Asp 


Ser 


Phe 


Trp 


Ala 


Ala 


Ala 


Asp 


Gin 


Met 


Ala Arg 


Gly 


Phe Val 




1 








5 










10 








15 




Leu 


Gly 


Ala 


Thr 
20 


Ala 


Gly 


Arg 


Thr 


Thr 
25 


Leu 


Thr 


Gly Glu 


Gly 
30 


Leu Gin 




His 


Ala 


Asp 


Gly 


His 


Ser 


Leu 


Leu 


Leu Asp 


Ala 


Thr Asn 


Pro 


Ala Val 








35 










40 








45 








Val 


Ala 
50 


Tyr 


Asp 


Pro 


Ala 


Phe 
55 


Ala 


Tyr 


Glu 


He 


Gly Tyr 
60 


He 


Xaa Glu 




Ser 


Gly 


Leu 


Ala 


Arg 


Met 


Cys 


Gly 


Glu 


Asn 


Pro 


Glu Asn 


He 


Phe Phe 




65 










70 










75 






80 




Tyr 


lie 


Thr 


Val 


Tyr 
85 


Asn 


Glu 


Pro 


Tyr 


Val 
90 


Gin 


Pro Pro 


Glu 


Pro Glu 
95 




Asn 


Phe 


Asp 


Pro 


Glu Gly 


Val 


Leu 


Gly Gly 


He 


Tyr Arg 


Tyr 


His Ala 










100 










105 








110 






Ala 


Thr 


Glu 


Gin 


Arg 


Thr 


Asn 


Lys 


Xaa 


Gin 


He 


Leu Ala 


Ser 


Gly Val 






115 










120 








125 






O 


Ala 


Met 


Pro 


Ala 


Ala 


Leu 


Arg 


Ala 


Ala 


Gin 


Met 


Leu Ala 


Ala 


Glu Trp 




130 










135 










140 






03 


Asp 


Val 


Ala 


Ala 


Asp 


Val 


Trp 


Ser 


Val 


Thr 


Ser 


Trp Gly 


Glu 


Leu Asn 


145 










150 










155 






160 


nn 

• ==s 


Arg 


Asp 


Gly 


Val 


Val 


He 


Glu 


Thr 


Glu 


Lys 


Leu 


Arg His 


Pro 


Asp Arg 












165 










170 








175 


W 


Pro 


Ala 


Gly 


Val 


Pro 


Tyr 


Val 


Thr 


Arg 


Ala 


Leu 


Glu Asn 


Ala 


Arg Gly 


E 








180 










185 








190 




□ 

ry 


Pro 


Val 


He 
195 


Ala 


Val 


Ser 


Asp 


Trp 
200 


Met 


Arg 


Ala 


Val Pro 
205 


Glu 


Gin He 


nj 


Arg 


Pro 


Trp 


Val 


Pro 


Gly 


Thr 


Tyr 


Leu 


Thr 


Leu 


Gly Thr 


Asp 


Gly Phe 


HI 




210 










215 










220 








Gly 


Phe 


Ser 


Asp 


Thr 


Arg 


Pro 


Ala 


Gly Arg Arg 


Tyr Phe 


Asn 


Thr Asp 


nj . 


225 










230 










235 






240 


Ala 


Glu 


Ser 


Gin 


Val 


Gly 


Arg Gly 


Phe 


Gly Arg 


Gly Trp 


Pro 


Gly Arg 












245 










250 








255 




Arg 


Val 


Asn 


He 


Asp 


Pro 


Phe 


Gly 


Ala Gly Arg 


Gly Pro 


Pro 


Ala Gin 










260 










265 








270 






Leu 


Pro 


Gly Phe 


Asp 


Glu 


Gly Gly 


Gly Leu Arg 


Pro Xaa 


Lys 










275 










280 








285 







(2) INFORMATION FOR SEQ ID NO: 82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 173 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 

Thr Lys Phe His Ala Leu Met Gin Glu Gin He His Asn Glu Phe Thr 

15 10 15 

Ala Ala Gin Gin Tyr Val Ala He Ala Val Tyr Phe Asp Ser Glu Asp 

20 25 30 

Leu Pro Gin Leu Ala Lys His Phe Tyr Ser Gin Ala Val Glu Glu Arg 
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35 40 45 

Asn His Ala Met Met Leu Val Gin His Leu Leu Asp Arg Asp Leu Arg 

50 55 60 

Val Glu lie Pro Gly Val Asp Thr Val Arg Asn Gin Phe Asp Arg Pro 
65 70 75 80 

Arg Glu Ala Leu Ala Leu Ala Leu Asp Gin Glu Arg Thr Val Thr Asp 

85 90 95 

Gin Val Gly Arg Leu Thr Ala Val Ala Arg Asp Glu Gly Asp Phe Leu 

100 105 110 

Gly Glu Gin Phe Met Gin Trp Phe Leu Gin Glu Gin He Glu Glu Val 

115 120 125 

Ala Leu Met Ala Thr Leu Val Arg Val Ala Asp Arg Ala Gly Ala Asn 

130 135 140 

Leu Phe Glu Leu Glu Asn Phe Val Ala Arg Glu Val Asp Val Ala Pro 
145 150 155 * 160 

Ala Ala Ser Gly Ala Pro His Ala Ala Gly Gly Axg Leu 
165 170 

O (2) INFORMATION FOR SEQ ID NO: 83: 

Q 

S3 (i) SEQUENCE CHARACTERISTICS: 

=p (A) LENGTH: 107 amino acids 

03 (B) TYPE: amino acid 

cp / (C) STRANDEDNESS : single 

UJ (D) TOPOLOGY: linear 

Q (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 

nj 

fy Arg Ala Asp Glu Arg Lys Asn Thr Thr Met Lys Met Val Lys Ser lie 

IJi 1 5 10 15 

q Ala Ala Gly Leu Thr Ala Ala Ala Ala lie Gly Ala Ala Ala Ala Gly 

20 25 30 

Val Thr Ser lie Met Ala Gly Gly Pro Val Val Tyr Gin Met Gin Pro 

35 40 45 

Val Val Phe Gly Ala Pro Leu Pro Leu Asp Pro Xaa Ser Ala Pro Xaa 

50 55 60 

Val Pro Thr Ala Ala Gin Trp Thr Xaa Leu Leu Asn Xaa Leu Xaa Asp 
65 70 75 80 

Pro Asn Val Ser Phe Xaa Asn Lys Gly Ser Leu Val Glu Gly Gly lie 

85 90 95 

Gly Gly Xaa Glu Gly Xaa Xaa Arg Arg Xaa Gin 
100 105 



ill 



(2) INFORMATION FOR SEQ ID NO: 84: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 125 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 

Val Leu Ser Val Pro Val Gly Asp Gly Phe Trp Xaa Arg Val Val Asn 

1 5 10 15 

Pro Leu Gly Gin Pro lie Asp Gly Arg Gly Asp Val Asp Ser Asp Thr 
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20 






25 




30 




Arg 


Arg 


Ala 


Leu Glu 


Leu Gin 


Ala 


Pro 


Ser 


Val Val Xaa Arg 


Gin Gly 






35 






40 






45 




Val 


Lys 


Glu 


Pro Leu 


Xaa Thr 


Gly 


He 


Lys 


Ala He Asp Ala 


Met Thr 




50 






55 








60 




Pro 


He 


Gly 


Arg Gly 


Gin Arg 


Gin 


Leu 


He 


He Gly Asp Arg 


Lys Thr 


65 








70 








7 5 


80 


Gly 


Lys 


Asn 


Arg Arg 


Leu Cys 


Arg 


Thr 


Pro 


Ser Ser Asn Gin 


Arg Glu 








85 








90 




95 


Glu 


Leu 


Gly 


Val Arg 


Trp He 


Pro 


Arg 


Ser 


Arg Cys Ala Cys 


Val Tyr 








100 






105 




110 




Val 


Gly 


His 


Arg Ala 


Arg Arg Gly Thr Tyr His Arg Arg 








115 






120 






125 





(2) INFORMATION FOR SEQ ID NO: 85: 



{ i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 117 amino acids 

(B) TYPE: amino acid 
p (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



P 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:85: 

Cys Asp Ala Val Met Gly Phe Leu Gly Gly Ala Gly Pro Leu Ala Val 
15 10 15 

L_ Val Asp Gin Gin Leu Val Thr Arg Val Pro Gin Gly Trp Ser Phe Ala 

20 25 30 

Gin Ala Ala Ala Val Pro Val Val Phe Leu Thr Ala Trp Tyr Gly Leu 

35 40 45 

Ala Asp Leu Ala Glu He Lys Ala Gly Glu Ser Val Leu He His Ala 

50 55 60 

Gly Thr Gly Gly Val Gly Met Ala Ala Val Gin Leu Ala Arg Gin Trp 
65 70 75 80 

Gly Val Glu Val Phe Val Thr Ala Ser Arg Gly Lys Trp Asp Thr Leu 

85 90 95 

Arg Ala Xaa Xaa Phe Asp Asp Xaa Pro Tyr Arg Xaa Phe Pro His Xaa 

100 105 110 

Arg Ser Ser Xaa Gly 
115 

(2) INFORMATION FOR SEQ ID NO: 86: 



w 

E 

ru 
ru 

tn 
p 

F3 2 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 103 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 



Met Tyr Arg Phe Ala Cys Arg Thr 
1 5 
Ala Thr Gly Val Ala Gly Leu Gly 
20 



Leu Met Leu Ala Ala Cys He Leu 

10 15 
Val Gly Ala Gin Ser Ala Ala Gin 
25 30 



i 
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Thr Ala Pro Val Pro Asp Tyr Tyr 

35 40 
Pro Ala Trp Gly Pro Asn Trp Asp 

50 55 
His Arg Asp Ser Asp Gly Pro Asp 
65 70 
lie Leu Glu Gly Pro Val Leu Asp 
85 

Pro Ala Ala Gly Gly Gly Ala 
100 

(2) INFORMATION FOR SEQ ID NO: 87: 



Trp Cys Pro Gly Gin Pro Phe Asp 
45 

Pro Tyr Thr Cys His Asp Asp Phe 
60 

His Ser Arg Asp Tyr Pro Gly Pro 

75 ^ 80 

Asp Pro Gly Ala Ala Pro Pro Pro 
90 95 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 88 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
^ (D) TOPOLOGY: linear 

o 

p (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 

m 



W 
□ 

jpj 5 

g y 

m- 
ru 



Val 


Gin 


Cys 


Arg 


Val 


Trp 


Leu 


Glu 


He Gin Trp Arg Gly Met Leu Gly 


1 








5 








10 15 


Ala 


Asp 


Gin 


Ala 


Arg 


Ala 


Gly Gly 


Pro Ala Arg He Trp Arg Glu His 








20 










25 30 


Ser 


Met 


Ala 


Ala 


Met 


Lys 


Pro 


Arg 


Thr Gly Asp Gly Pro Leu Glu Ala 






35 










40 


45 


Thr 


Lys 


Glu 


Gly 


Arg 


Gly 


He 


Val 


Met Arg Val Pro Leu Glu Gly Gly 




50 










55 




60 


Gly 


Arg 


Leu 


Val 


Val 


Glu 


Leu 


Thr 


Pro Asp Glu Ala Ala Ala Leu Gly 


65 










70 






75 80 


Asp 


Glu 


Leu 


Lys 


Gly 


Val 


Thr 


Ser 












85 











(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 95 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 

Thr Asp Ala Ala Thr Leu Ala Gin Glu Ala Gly Asn Phe Glu Arg He 

1 5 10 15 

Ser Gly Asp Leu Lys Thr Gin He Asp Gin Val Glu Ser Thr Ala Gly 

20 25 30 

Ser Leu Gin Gly Gin Trp Arg Gly Ala Ala Gly Thr Ala Ala Gin Ala 

35 40 45 

Ala Val Val Arg Phe Gin Glu Ala Ala Asn Lys Gin Lys Gin Glu Leu 

50 55 60 

Asp Glu He Ser Thr Asn He Arg Gin Ala Gly Val Gin Tyr Ser Arg 
65 70 75 80 

Ala Asp Glu Glu Gin Gin Gin Ala Leu Ser Ser Gin Met Gly Phe 
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85 90 95 

(2) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 166 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 

Met Thr Gin Ser Gin Thr Val Thr Val Asp Gin Gin Glu lie Leu Asn 

1 5 10 15 

Arg Ala Asn Glu Val Glu Ala Pro Met Ala Asp Pro Pro Thr . Asp Val 

20 25 30 

Pro lie Thr Pro Cys Glu Leu Thr Xaa Xaa Lys Asn Ala Ala Gin Gin 

35 40 45 

Xaa Val Leu Ser Ala Asp Asn Met Arg Glu Tyr Leu Ala Ala Gly Ala 

m 50 55 60 

*jf Lys Glu Arg Gin Arg Leu Ala Thr Ser Leu Arg Asn Ala Ala Lys Xaa 

J 65 70 75 80 

Tyr Gly Glu Val Asp Glu Glu Ala Ala Thr Ala Leu Asp Asn Asp Gly 

85 90 95 

Glu Gly Thr Val Gin Ala Glu Ser Ala Gly Ala Val Gly Gly Asp Ser 
2 100 105 110 

0 Ser Ala Glu Leu Thr Asp Thr Pro Arg Val Ala Thr Ala Gly Glu Pro 
[IJ 115 120 125 

FU Asn Phe Met Asp Leu Lys Glu Ala Ala Arg Lys Leu Glu Thr Gly Asp 

01 130 135 140 

O Gin Gly Ala Ser Leu Ala His Xaa Gly Asp Gly Trp Asn Thr Xaa Thr 

fy 145 150 155 160 

Leu Thr Leu Gin Gly Asp 
165 



(2) INFORMATION FOR SEQ ID NO: 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 

Arg Ala Glu Arg Met 
1 5 



(2) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 263 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 





Val 


Ala 


Trp 


Met 


Ser 


Val 


Thr 


Ala 


Gly Gin 


Ala 


Glu 


Leu 


Thr 


Ala Ala 




1 








5 








10 










15 




Gin 


Val 


Arg 


Val 
20 


Ala 


Ala 


Ala 


Ala 


Tyr Glu 
25 


Thr 


Ala 


Tyr 


Gly 
30 


Leu Thr 




Val 


Pro 


Pro 
35 


Pro 


Val 


He 


Ala 


Glu 
40 


Asn Arg 


Ala 


Glu 


Leu 
45 


Met 


He Leu 




He 


Ala 
50 


Thr 


Asn 


Leu 


Leu 


Gly 
55 


Gin 


Asn Thr 


Pro 


Ala 
60 


He 


Ala 


Val Asn 




Glu 


Ala 


Glu 


Tyr 


Gly 


Glu 


Met 


Trp 


Ala Gin 


Asp 


Ala 


Ala 


Ala 


Met Phe 




65 










70 








75 








80 




Gly 


Tyr 


Ala 


Ala 


Ala 
65 


Thr 


Ala 


Thr 


Ala Thr 
90 


Ala 


Thr 


Leu 


Leu 


Pro Phe 




Glu 


Glu 


Ala 


Pro 


Glu 


Met 


Thr 


Ser 


Ala Gly 


Gly JLeu Leu GXu -Gin Ala 










100 










105 








110 






Ala 


Ala 


Val 


Glu 


Glu 


Ala 


Ser 


Asp 


Thr Ala 


Ala 


Ala 


Asn 


Gin 


Leu Met 


O 






115 










120 








125 






3 


Asn 


Asn 


Val 


Pro 


Gin 


Ala 


Leu 


Lys 


Gin Leu 


Ala 


Gin 


Pro 


Thr 


Gin Gly 


00 




130 










135 








14 0 








tsss 

do 


Thr 
145 


Thr 


Pro 


Ser 


Ser 


Lys 
150 


Leu 


Gly 


Gly Leu 


Trp 
155 


Lys 


Thr 


Val 


Ser Pro 
160 




His 


Arg 


Ser 


Pro 


He 


Ser 


Asn 


Met 


Val Ser 


Met 


Ala 


Asn 


Asn 


His Met 


hj 










165 








170 










175 


E 


Ser 


Met 


Thr 


Asn 


Ser 


Gly 


Val 


Ser 


Met Thr 


Asn 


Thr 


Leu 


Ser 


Ser Met 


Q 








180 










185 








190 




P = 


Leu 


Lys 


Gly 


Phe 


Ala 


Pro 


Ala 


Ala 


Ala Ala 


Gin 


Ala 


Val 


Gin 


Thr Ala 








195 










200 








205 






m 


Ala 


Gin 


Asn 


Gly 


Val 


Arg 


Ala 


Met 


Ser Ser 


Leu Gly Ser 


Ser 


Leu Gly 


•ST S 

B 




210 










215 








220 








ru 


Ser 


Ser 


Gly 


Leu 


Gly 


Gly 


Gly 


Val 


Ala Ala 


Asn Leu Gly Arg Ala Ala 


225 










230 








235 








240 




Ser 


Val 


Arg 


Tyr 


Gly 


His 


Arg 


Asp 


Gly Gly 


Lys 


Tyr Ala Xaa 


Ser Gly 












245 








250 










255 




Arg 


Arg 


Asn 


Gly 
260 


Gly 


Pro 


Ala 

















(2) INFORMATION FOR SEQ ID NO: 92: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 303 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: 

Met Thr Tyr Ser Pro Gly Asn Pro Gly Tyr Pro Gin Ala Gin Pro Ala 

15 10 15 

Gly Ser Tyr Gly Gly Val Thr Pro Ser Phe Ala His Ala Asp Glu Gly 

20 25 30 

Ala Ser Lys Leu Pro Met Tyr Leu Asn He Ala Val Ala Val Leu Gly 

35 40 45 

Leu Ala Ala Tyr Phe Ala Ser Phe Gly Pro Met Phe Thr Leu Ser Thr 
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50 55 60 

Glu Leu Gly Gly Gly Asp Gly Ala Val Ser Gly Asp Thr Gly Leu Pro 
65 70 75 80 

Val Gly Val Ala Leu Leu Ala Ala Leu Leu Ala Gly Val Val Leu Val 

85 90 95 

Pro Lys Ala Lys Ser His Val Thr Val Val Ala Val Leu Gly Val Leu 

100 105 110 

Gly Val Phe Leu Met Val Ser Ala Thr Phe Asn Lys Pro Ser Ala Tyr 

115 120 125 

Ser Thr Gly Trp Ala Leu Trp Val Val Leu Ala Phe lie Val Phe Gin 

130 135 140 

Ala Val Ala Ala Val Leu Ala Leu Leu Val Glu Thr Gly Ala lie Thr 
145 150 155 160 

Ala Pro Ala Pro Arg Pro Lys Phe Asp Pro Tyr Gly Gin Tyr Gly Arg 

165 170 175 

Tyr Gly Gin Tyr Gly Gin Tyr Gly Val Gin Pro Gly Gly Tyr Tyr Gly 

TS0 185 190 

Gin Gin Gly Ala Gin Gin Ala Ala Gly Leu Gin Ser Pro Gly Pro Gin 

195 200 205 

Gin Ser Pro Gin Pro Pro Gly Tyr Gly Ser Gin Tyr Gly Gly Tyr Ser 

210 215 220 

Ser Ser Pro Ser Gin Ser Gly Ser Gly Tyr Thr Ala Gin Pro Pro Ala 
225 230 235 240 

Gin Pro Pro Ala Gin Ser Gly Ser Gin Gin Ser His Gin Gly Pro Ser 

245 250 255 

Thr Pro Pro Thr Gly Phe Pro Ser Phe Ser Pro Pro Pro Pro Val Ser 

260 265 270 

Ala Gly Thr Gly Ser Gin Ala Gly Ser Ala Pro Val Asn Tyr Ser Asn 

275 280 285 

Pro Ser Gly Gly Glu Gin Ser Ser Ser Pro Gly Gly Ala Pro Val 
290 295 300 

> INFORMATION FOR SEQ ID NO: 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ 13 

Gly Cys Gly Glu Thr Asp Ala Ala 
1 5 
Phe Glu Arg lie Ser Gly Asp Leu 
20 



> NO: 93: 

Thr Leu Ala Gin Glu Ala Gly Asn 

10 15 
Lys Thr Gin lie 
25 



INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 94 : 

Asp Gin Val Glu Ser Thr Ala Gly Ser Leu Gin Gly Gin Trp Arg Gly 
15 10 15 



(2) INFORMATION FOR SEQ ID NO: 95: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95: 

Gly Cys Gly Ser Thr Ala Gly Ser Leu Gin Gly Gin Trp Arg Gly Ala 

M* 1 5 10 15 

O Ala Gly Thr Ala Ala Gin Ala Ala Val Val Arg 

Q 20 25 

few 

JE (2) INFORMATION FOR SEQ ID NO: 96: 

00 

Jp (i) SEQUENCE CHARACTERISTICS: 

yj (A) LENGTH: 2 7 amino acids 

B (B) TYPE: amino acid 

p (C) STRANDEDNESS: single 

nj (D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 

Gly Cys Gly Gly Thr Ala Ala Gin Ala Ala Val Val Arg Phe Gin Glu 
15 10 15 

Ala Ala Asn Lys Gin Lys Gin Glu Leu Asp Glu 
20 25 



(2) INFORMATION FOR SEQ ID NO: 97: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 

Gly Cys Gly Ala Asn Lys Gin Lys Gin Glu Leu Asp Glu lie Ser Thr 
15 10 15 

Asn lie Arg Gin Ala Gly Val Gin Tyr Ser Arg 
20 25 

(2) INFORMATION FOR SEQ ID NO: 98: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 

Gly Cys Gly lie Arg Gin Ala Gly Val Gin Tyr Ser Arg Ala Asp Glu 
1.5 10 15 

Glu Gin Gin Gin Ala Leu Ser Ser Gin Met Gly Phe 
20 25 

(2) INFORMATION FOR SEQ ID NO: 99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 507 base .pairs 

(B) TYPE: nucleic acid 
M; (C) STRANDEDNESS: single 
y (D) TOPOLOGY: linear 

Q 

W (xi) SEQUENCE . DESCRIPTION: SEQ ID NO: 99: 

EG ATGAAGATGG TGAAATCGAT CGCCGCAGGT CTGACCGCCG CGGCTGCAAT CGGCGCCGCT 60 

GCGGCCGGTG TGACTTCGAT CATGGCTGGC GGCCCGGTCG TATACCAGAT GCAGCCGGTC 120 

Ly GTCTTCGGCG CGCCACTGCC GTTGGACCCG GCATCCGCCC CTGACGTCCC GACCGCCGCC 18 0 

s CAGTTGACCA GCCTGCTCAA CAGCCTCGCC GATCCCAACG TGTCGTTTGC GAACAAGGGC 240 

O AGTCTGGTCG AGGGCGGCAT CGGGGGCACC GAGGCGCGCA TCGCCGACCA CAAGCTGAAG 300 

f|J AAGGCCGCCG AGCACGGGGA TCTGCCGCTG TCGTTCAGCG TGACGAACAT CCAGCCGGCG 360 

fy GCCGCCGGTT CGGCCACCGC CGACGTTTCC GTCTCGGGTC CGAAGCTCTC GTCGCCGGTC 420. 

ifl ACGCAGAACG TCACGTTCGT GAATCAAGGC GGCTGGATGC TGTCACGCGC ATCGGCGATG 480 

GAGTTGCTGC AGGCCGCAGG GAACTGA 507 



pi 



(2) INFORMATION FOR SEQ ID NO: 100: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 168 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 
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Leu 
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100 105 110 

Ser Val Thr Asn lie Gin Pro Ala Ala Ala Gly Ser Ala Thr Ala Asp 

115 120 125 

Val Ser Val Ser Gly Pro Lys Leu Ser Ser Pro Val Thr Gin Asn Val 

130 135 140 

Thr Phe Val Asn Gin Gly Gly Trp Met Leu Ser Arg Ala Ser Ala Met 
145 150 155 160 

Glu Leu Leu Gin Ala Ala Gly Asn 
165 

(2) INFORMATION FOR SEQ ID NO: 101: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 500 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 

* CGTGGCAATG TCGTTGACCG TCGGGGCCGG GGTCGCCTCC GCAGATCCCG TGGACGCGGT 60 

J CATTAACACC ACCTGCAATT ACGGGCAGGT AGTAGCTGCG CTCAACGCGA CGGATCCGGG 120 

~ GGCTGCCGCA CAGTTCAACG CCTCACCGGT GGCGCAGTCC TATTTGCGCA ATTTCCTCGC 180 

Hp CGCACCGCCA CCTCAGCGCG CTGCCATGGC CGCGCAATTG CAAGCTGTGC CGGGGGCGGC 240 

^ ACAGTACATC GGCCTTGTCG AGTCGGTTGC CGGCTCCTGC AACAACTATT AAGCCCATGC 300 

B _ GGGCCCCATC CCGCGACCCG GCATCGTCGC CGGGGCTAGG CCAGATTGCC CCGCTCCTCA 360 

ACGGGCCGCA TCCCGCGACC CGGCATCGTC GCCGGGGCTA GGCCAGATTG CCCCGCTCCT 420 

fU CAACGGGCCG CATCTCGTGC CGAATTCCTG CAGCCCGGGG GATCCACTAG TTCTAGAGCG 480 



o 



Fa £ 
: k s 

m 



GCCGCCACCG CGGTGGAGCT 500 



□ (2) INFORMATION FOR SEQ ID NO: 102: 

fU 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 96 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102: 

Val Ala Met Ser Leu Thr Val Gly Ala Gly Val Ala Ser Ala Asp Pro 

15 10 15 

Val Asp Ala Val lie Asn Thr Thr Cys Asn Tyr Gly Gin Val Val Ala 

20 25 30 

Ala Leu Asn Ala Thr Asp Pro Gly Ala Ala Ala Gin Phe Asn Ala Ser 

35 40 45 

Pro Val Ala Gin Ser Tyr Leu Arg Asn Phe Leu Ala Ala Pro Pro Pro 

50 55 60 

Gin Arg Ala Ala Met Ala Ala Gin Leu Gin Ala Val Pro Gly Ala Ala 
65 70 75 80 

Gin Tyr lie Gly Leu Val Glu Ser Val Ala Gly Ser Cys Asn Asn Tyr 
85 90 95 



(2) INFORMATION FOR SEQ ID NO: 103: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 154 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:103: 

ATGACAGAGC AGCAGTGGAA TTTCGCGGGT ATCGAGGCCG CGGCAAGCGC AATCCAGGGA 
AATGTCACGT CCATTCATTC CCTCCTTGAC GAGGGGAAGC AGTCCCTGAC CAAGCTCGCA 
GCGGCCTGGG GCGGTAGCGG TTCGGAAGCG TACC 

(2) INFORMATION FOR SEQ ID NO: 104: 

(i) SEQUENCE CHARACTERISTICS: 

CA) XENGTH: 51 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:104: 

Met Thr Glu Gin Gin Trp Asn Phe Ala Gly lie Glu Ala Ala Ala Ser 

15 10 15 

Ala lie Gin Gly Asn Val Thr Ser lie His Ser Leu Leu Asp Glu Gly 

20 25 30 

Lys Gin Ser Leu Thr Lys Leu Ala Ala Ala Trp Gly Gly Ser Gly Ser 

35 40 45 

Glu Ala Tyr 
50 

(2) INFORMATION FOR SEQ ID NO: 105: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 282 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 

CGGTCGCGCA CTTC CAGGTG ACTATGAAAG TCGGCTTCCG NCTGGAGGAT TCCTGAACCT 
TCAAGCGCGG CCGATAACTG AGGTGCATCA TTAAGCGACT TTTCCAGAAC ATCCTGACGC 
GCTCGAAACG CGGCACAGCC GACGGTGGCT CCGNCGAGGC GCTGNCTCCA AAATCCCTGA 
GACAATTCGN CGGGGGCGCC TACAAGGAAG TCGGTGCTGA ATTCGNCGNG TATCTGGTCG 
ACCTGTGTGG TCTGNAGCCG GACGAAGCGG TGCTCGACGT CG 

(2) INFORMATION FOR SEQ ID NO: 106: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3058 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106: 

GATCGTACCC GTGCGAGTGC TCGGGCCGTT TGAGGATGGA GTGCACGTGT CTTTCGTGAT 60 

GG CAT AC C CA GAGATGTTGG CGGCGGCGGC TGACACCCTG CAGAGCATCG GTGCTACCAC 120 

TGTGGCTAGC AATGCCGCTG CGGCGGCCCC GACGACTGGG GTGGTGCCCC CCGCTGCCGA 180 

TGAGGTGTCG GCGCTGACTG CGGCGCACTT CGCCGCACAT GCGGCGATGT ATCAGTCCGT 240 

GAGCGCTCGG GCTGCTGCGA TTCATGACCA GTTCGTGGCC ACCCTTGCCA GCAGCGCCAG 300 

CTCGTATGCG GCCACTGAAG TCGCCAATGC GGCGGCGGCC AGCTAAGCCA GGAACAGTCG 360 

GCACGAGAAA CCACGAGAAA TAGGGACACG TAATGGTGGA TTTCGGGGCG TTAC CACCGG 420 

AGATCAACTC CGCGAGGATG TACGCCGGCC CGGGTTCGGC CTCGCTGGTG GCCGCGGCTC 480 

AGATGTGGGA CAGCGTGGCG AGTGACCTGT TTTCGGCCGC GTCGGCGTTT CAGTCGGTGG 54 0 

TCTGGGGTCT GACGGTGGGG TCGTGGATAG GTTCGTCGGC GGGTCTGATG GTGGCGGCGG 600 

CCTCGCCGTA TGTGGCGTGG ATGAGCGTCA CCGCGGGGCA GGCCGAGCTG ACCGCCGCCC 660 

AGGTCCGGGT TGCTGCGGCG GCCTACGAGA CGGCGTATGG GCTGACGGTG CCCCCGCCGG 720 

TGATCGCCGA GAACCGTGCT GAACTGATGA TTCTGATAGC GACCAACCTC TTGGGGCAAA 780 

ACACCCCGGC GATCGCGGTC AACGAGGCCG AATACGGCGA GATGTGGGCC CAAGACGCCG 840 

CCGCGATGTT TGGCTACGCC GCGGCGACGG CGACGGCGAC GGCGACGTTG CTGCCGTTCG 900 

AGGAGGCGCC GGAGATGACC AGCGCGGGTG GGCTCCTCGA GCAGGCCGCC GCGGTCGAGG 960 

AGGCCTC CGA CACCGCCGCG GCGAACCAGT TGATGAACAA TGTGCCCCAG GCGCTGCAAC 1020 

AGCTGGCCCA GCCCACGCAG GGCACCACGC CTTCTTCCAA GCTGGGTGGC CTGTGGAAGA 1080 

CGGTCTCGCC GCATCGGTCG CCGATCAGCA ACATGGTGTC GATGGCCAAC AAC CACATGT 1140 

CGATGACCAA CTCGGGTGTG TCGATGACCA ACACCTTGAG CTCGATGTTG AAGGGCTTTG 1200 

CTCCGGCGGC GGCCGCCCAG GCCGTGCAAA CCGCGGCGCA AAACGGGGTC CGGGCGATGA 1260 

GCTCGCTGGG CAGCTCGCTG GGTTCTTCGG GTCTGGGCGG TGGGGTGGCC GCCAACTTGG 1320 

GTCGGGCGGC CTCGGTCGGT TCGTTGTCGG TGCCGCAGGC CTGGGCCGCG GCCAACCAGG 1380 

CAGTCACCCC GGCGGCGCGG GCGCTGCCGC TGACCAGCCT GACCAGCGCC GCGGAAAGAG 1440 

GGCCCGGGCA GATGCTGGGC GGGCTGCCGG TGGGGCAGAT GGGCGCCAGG GCCGGTGGTG 1500 

GGCTCAGTGG TGTGCTGCGT GTTCCGCCGC GACCCTATGT GATGCCGCAT TCTCCGGCGG 1560 

CCGGCTAGGA GAGGGGGCGC AGACTGTCGT TATTTGACCA GTGATCGGCG GTCTCGGTGT 1620 
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TTCCGCGGCC GGCTATGACA ACAGTCAATG TGCATGACAA GTTACAGGTA TTAGGTCCAG 1680 

GTTCAACAAG GAGACAGGCA ACATGGCCTC ACGTTTTATG ACGGATCCGC ACGCGATGCG 1740 

GGACATGGCG GGCCGTTTTG AGGTGCACGC CCAGACGGTG GAGGACGAGG CTCGCCGGAT 1800 

GTGGGCGTCC GCGCAAAACA TTTCCGGTGC GGGCTGGAGT GGCATGGCCG AGGCGACCTC 186 0 

GCTAGACACC ATGGCCCAGA TGAATCAGGC GTTTCGCAAC ATCGTGAACA TGCTGCACGG 192 0 

GGTGCGTGAC GGGCTGGTTC GCGAGGCCAA CAACTACGAG CAGCAAGAGC AGGCCTCCCA 1980 

GCAGATCCTC AGCAGCTAAC GTCAGCCGCT GCAGCACAAT ACTTTTACAA GCGAAGGAGA 2040 

ACAGGTTCGA TGAGGATGAA CTATGAATTC GGGGATGTCG ACGCTCACGG CGCCATGATC 2100 

CGCGCTCAGG CCGGGTTGCT GGAGGCCGAG CATCAGGCCA TCATTCGTGA TGTGTTGACC 2160 

GCGAGTGACT TTTGGGGCGG CGCCGGTTCG GCGGCCTGCC AGGGGTT CAT TACCCAGTTG 222 0 

GGCCGTAACT TCCAGGTGAT CTACGAGCAG GCCAACGCCC ACGGGCAGAA GGTGCAGGCT 22 80 



+1 GCCGGCAACA ACATGGCGCA AAC CGACAGC GCCGTCGGCT CCAGCTGGGC CTGACACCAG 2340 



GCCAAGGCCA GGGACGTGGT GTACGAGTGA AGTTCCTCGC GTGATCCTTC GGGTGGCAGT 2400 



%P CTAAGTGGTC AGTGCTGGGG TGTTGGTGGT TTGCTGCTTG GCGGGTTCTT CGGTGCTGGT 2460 

Ul CAGTGCTGCT CGGGCTCGGG TGAGGACCTC GAGGCCCAGG TAGCGCCGTC CTTCGATCCA 2520 

fU TTCGTCGTGT TGTTCGGCGA GGACGGCTCC GACGAGGCGG ATGATCGAGG CGCGGTCGGG 2580 

GAAGATGCCC ACGACGTCGG TTCGGCGTCG TACCTCTCGG TTGAGGCGTT CCTGGGGGTT 2640 

GTTGGACCAG ATTTGGCGCC AGATCTGCTT GGGGAAGGCG GTGAACGCCA GCAGGTCGGT 2700 

GCGGGCGGTG TCGAGGTGCT CGGCCACCGC GGGGAGTTTG TCGGTCAGAG CGTCGAGTAC 2760 

CCGATCATAT TGGGCAACAA CTGATTCGGC GTCGGGCTGG TCGTAGATGG AGTGCAGCAG 2820 

GGTGCGCACC CACGGCCAGG AGGGCTTCGG GGTGGCTGCC ATCAGATTGG CTGCGTAGTG 2880 

GGTTCTGCAG CGCTGCCAGG CCGCTGCGGG CAGGGTGGCG CCGATCGCGG CCACCAGGCC 294 0 

GGCGTGGGCG TCGCTGGTGA CCAGCGCGAC CCCGGACAGG CCGCGGGCGA CCAGGTCGCG 3000 

GAAGAACGCC AGCCAGCCGG CCCCGTCCTC GGCGGAGGTG ACCTGGATGC CCAGGATC 3058 
(2) INFORMATION FOR SEQ ID NO: 107: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3 91 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
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(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:107: 



Met Val Asp Phe Gly Ala Leu Pro 
1 5 

Tyr Ala Gly Pro Gly Ser Ala Ser 
20 

Asp Ser Val Ala Ser Asp Leu Phe 
35 40 



Pro Glu lie Asn Ser Ala Arg Met 
10 15 

Leu Val Ala Ala Ala Gin Met Trp 
25 30 

Ser Ala Ala Ser Ala Phe Gin Ser 
45 



Val Val Trp Gly Leu Thr Val Gly Ser Trp lie Gly Ser Ser Ala Gly 
50 55 60 

Leu Met Val Ala Ala Ala Ser Pro Tyr Val Ala Trp Met Ser Val Thr 
65 70 75 80 

Ala Gly Gin Ala Glu Leu Thr Ala Ala Gin Val Arg Val Ala Ala Ala 
85 90 95 

Ala Tyr Glu Thr Ala Tyr Gly Leu Thr Val Pro Pro Pro Val lie Ala 
100 105 110 

Glu Asn Arg Ala Glu Leu Met lie Leu lie Ala Thr Asn Leu Leu Gly 
115 120 125 

Gin Asn Thr Pro Ala lie Ala Val Asn Glu Ala Glu Tyr Gly Glu Met 
130 135 140 

Trp Ala Gin Asp Ala Ala Ala Met Phe Gly Tyr Ala Ala Ala Thr Ala 
145 150 155 160 

Thr Ala Thr Ala Thr Leu Leu Pro Phe Glu Glu Ala Pro Glu Met Thr 
165 170 175 

Ser Ala Gly Gly Leu Leu Glu Gin Ala Ala Ala Val Glu Glu Ala Ser 
180 185 190 

Asp Thr Ala Ala Ala Asn Gin Leu Met Asn Asn Val Pro Gin Ala Leu 
195 200 205 

Gin Gin Leu Ala Gin Pro Thr Gin Gly Thr Thr Pro Ser Ser Lys Leu 
210 215 220 

Gly Gly Leu Trp Lys Thr Val Ser Pro His Arg Ser Pro lie Ser Asn 
225 230 235 240 

Met Val Ser Met Ala Asn Asn His Met Ser Met Thr Asn Ser Gly Val 
245 250 255 

Ser Met Thr Asn Thr Leu Ser Ser Met Leu Lys Gly Phe Ala Pro Ala 
260 265 270 
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Ala Ala Ala Gin Ala Val Gin Thr Ala Ala Gin Asn Gly Val Arg Ala 
275 280 285 

Met Ser Ser Leu Gly Ser Ser Leu Gly Ser Ser Gly Leu Gly Gly Gly 
290 295 300 

Val Ala Ala Asn Leu Gly Arg Ala Ala Ser Val Gly Ser Leu Ser Val 
305 310 315 320 

Pro Gin Ala Trp Ala Ala Ala Asn Gin Ala Val Thr Pro Ala Ala Arg 
325 330 335 

Ala Leu Pro Leu Thr Ser Leu Thr Ser Ala Ala Glu Arg Gly Pro Gly 
340 345 350 

Gin >iet Leu Gly Gly Leu Pro Val Gly Gin Met Gly Ala Arg Ala Gly 
M; 355 360 365 

Q 

Q Gly Gly Leu Ser Gly Val Leu Arg Val Pro Pro Arg Pro Tyr Val Met 

00 370 375 380 

33 

09 Pro His Ser Pro Ala Ala Gly 

*C 385 390 

s (2) INFORMATION FOR SEQ ID NO: 108: 

o 

f|| (i) SEQUENCE CHARACTERISTICS: 
nj (A) LENGTH: 172 5 base pairs 

fjS (B) TYPE: nucleic acid 

q (C) STRANDEDNESS : single 

n 'a (D) TOPOLOGY: linear 

1 y 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108: 

GACGTCAGCA CCCGCCGTGC AGGGCTGGAG CGTGGTCGGT TTTGATCTGC GGTCAAGGTG 60 

ACGTCCCTCG GCGTGTCGCC GGCGTGGATG CAGACTCGAT GCCGCTCTTT AGTGCAACTA 120 

ATTTCGTTGA AGTGCCTGCG AGGTATAGGA CTTCACGATT GGTTAATGTA GCGTTCACCC 180 

CGTGTTGGGG TCGATTTGGC CGGACCAGTC GTCACCAACG CTTGGCGTGC GCGCCAGGCG 240 

GGCGATCAGA TCGCTTGACT ACCAATCAAT CTTGAGCTCC CGGGCCGATG CTCGGGCTAA 300 

ATGAGGAGGA GCACGCGTGT CTTTCACTGC GCAACCGGAG ATGTTGGCGG CCGCGGCTGG 360 

CGAACTTCGT TCCCTGGGGG CAACGCTGAA GGCTAGCAAT GCCGCCGCAG CCGTGCCGAC 420 

GACTGGGGTG GTGCCCCCGG CTGCCGACGA GGTGTCGCTG CTGCTTGCCA CACAATTCCG 480 

TACGCATGCG GCGACGTATC AGACGGCCAG CGCCAAGGCC GCGGTGATCC ATGAGCAGTT 540 

TGTGACCACG CTGGCCACCA GCGCTAGTTC ATATGCGGAC ACCGAGGCCG CCAACGCTGT 600 

GGTCACCGGC TAGCTGACCT GACGGTATTC GAGCGGAAGG ATTATCGAAG TGGTGGATTT 660 
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CGGGGCGTTA CCACCGGAGA TCAACTCCGC GAGGATGTAC GCCGGCCCGG GTTCGGCCTC 720 

GCTGGTGGCC GCCGCGAAGA TGTGGGACAG CGTGGCGAGT GACCTGTTTT CGGCCGCGTC 780 

GGCGTTTCAG TCGGTGGTCT GGGGTCTGAC GGTGGGGTCG TGGATAGGff CGTCGGCGGG 840 

TCTGATGGCG GCGGCGGCCT CGCCGTATGT GGCGTGGATG AGCGTCACCG CGGGGCAGGC 900 

CCAGCTGACC GCCGCCCAGG TCCGGGTTGC TGCGGCGGCC TACGAGACAG CGTATAGGCT 960 

GACGGTGCCC CCGCCGGTGA TCGCCGAGAA CCGTACCGAA CTGATGACGC TGACCGCGAC 1020 

CAACCTCTTG GGGCAAAACA CGCCGGCGAT CGAGGCCAAT CAGGCCGCAT ACAGCCAGAT 1080 

GTGGGGCCAA GACGCGGAGG CGATGTAT3G eTACGCCGCC ACGGCGGCGA CGGCGACCGA 1140 

GGCGTTGCTG CCGTTCGAGG ACGCCCCACT GATCACCAAC CCCGGCGGGC TCCTTGAGCA 1200 

GGCCGTCGCG GTCGAGGAGG CCATCGACAC CGCCGCGGCG AACCAGTTGA TGAACAATGT 1260 

GCCCCAAGCG CTGCAACAGC TGGCCCAGCC AGCGCAGGGC GTCGTACCTT CTTCCAAGCT 1320 

GGGTGGGCTG TGGACGGCGG TCTCGCCGCA TCTGTCGCCG CTCAGCAACG TCAGTTCGAT 138 0 

AGCCAACAAC CACATGTCGA TGATGGGCAC GGGTGTGTCG ATGACCAACA CCTTGCACTC 144 0 

GATGTTGAAG GGCTTAGCTC CGGCGGCGGC TCAGGCCGTG GAAACCGCGG CGGAAAACGG 1500 

GGTCTGGGCG ATGAGCTCGC TGGGCAGCCA GCTGGGTTCG TCGCTGGGTT CTTCGGGTCT 1560 

GGGCGCTGGG GTGGCCGCCA ACTTGGGTCG GGCGGCCTCG GTCGGTTCGT TGTCGGTGCC 1620 

GCCAGCATGG GCCGCGGCCA ACCAGGCGGT CACCCCGGCG GCGCGGGCGC TGCCGCTGAC 1680 

CAGCCTGACC AGCGCCGCCC AAACCGCCCC CGGACACATG CTGGG 1725 
(2) INFORMATION FOR SEQ ID NO: 109: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 359 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:109: 

Val Val Asp Phe Gly Ala Leu Pro Pro Glu lie Asn Ser Ala Arg Met 
15 10 15 

Tyr Ala Gly Pro Gly Ser Ala Ser Leu Val Ala Ala Ala Lys Met Trp 
20 25 30 

Asp Ser Val Ala Ser Asp Leu Phe Ser Ala Ala Ser Ala Phe Gin Ser 
35 40 45 
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Val Val Trp Gly Leu Thr Val Gly Ser Trp He Gly Ser Ser Ala Gly 
50 55 60 

Leu Met Ala Ala Ala Ala Ser Pro Tyr Val Ala Trp Met Ser Val Thr 
65 70 75 80 

Ala Gly Gin Ala Gin Leu Thr Ala Ala Gin Val Arg Val Ala Ala Ala 
85 90 95 

Ala Tyr Glu Thr Ala Tyr Arg Leu Thr Val Pro Pro Pro Val He Ala 
100 , 105 110 

Glu Asn Arg Thr Glu Leu Met Thr Leu Thr Ala Thr Asn Leu Leu Gly 
115 120 125 

Gin Asn Thr Pro Ala He Glu Ala Asn Gin Ala Ala Tyr Ser Gin Met 
130 135 140 

Trp Gly Gin Asp Ala Glu Ala Met Tyr Gly Tyr Ala Ala Thr Ala Ala 
145 150 155 160 

Thr Ala Thr Glu Ala Leu Leu Pro Phe Glu Asp Ala Pro Leu He Thr 
165 170 175 

Asn Pro Gly Gly Leu Leu Glu Gin Ala Val Ala Val Glu Glu Ala He 
180 185 190 

Asp Thr Ala Ala Ala Asn Gin Leu Met Asn Asn Val Pro Gin Ala Leu 
195 200 205 

Gin Gin Leu Ala Gin Pro Ala Gin Gly Val Val Pro Ser Ser Lys Leu 
210 215 220 

Gly Gly Leu Trp Thr Ala Val Ser Pro His Leu Ser Pro Leu Ser Asn 
225 230 235 240 

Val Ser Ser He Ala Asn Asn His Met Ser Met Met Gly Thr Gly Val 
245 250 255 

Ser Met Thr Asn Thr Leu His Ser Met Leu Lys Gly Leu Ala Pro Ala 
260 265 270 

Ala Ala Gin Ala Val Glu Thr Ala Ala Glu Asn Gly Val Trp Ala Met 
275 280 285 

Ser Ser Leu Gly Ser Gin Leu Gly Ser Ser Leu Gly Ser Ser Gly Leu 
290 295 300 

Gly Ala Gly Val Ala Ala Asn Leu Gly Arg Ala Ala Ser Val Gly Ser 
305 310 315 320 



Leu Ser Val Pro Pro Ala Trp Ala Ala Ala Asn Gin Ala Val Thr Pro 
325 330 335 
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Ala Ala Arg Ala Leu Pro Leu Thr Ser Leu Thr Ser Ala Ala Gin Thr 
340 345 350 

Ala Pro Gly His Met Leu Gly 
355 

(2) INFORMATION FOR SEQ ID NO; 110: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3027 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110: 

AGTTCAGTCG AGAATGATAC TGACGGGCTG TATCCACGAT GGCTGAGACA ACCGAACCAC 60 

CGTCGGACGC GGGGACATCG CAAGCCGACG CGATGGCGTT GGCCGCCGAA GCCGAAGCCG 120 

CCGAAGCCGA AGCGCTGGCC GCCGCGGCGC GGGCCCGTGC CCGTGCCGCC CGGTTGAAGC 180 

GTGAGGCGCT GGCGATGGCC CCAGCCGAGG ACGAGAACGT CCCCGAGGAT ATGCAGACTG 240 

GGAAGACGCC GAAGACTATG ACGACTATGA CGACTATGAG GCCGCAGACC AGGAGGCCGC 300 

ACGGTCGGCA TCCTGGCGAC GGCGGTTGCG GGTGCGGTTA CCAAGACTGT CCACGATTGC 360 



E 

• o 

ru 

rU CATGGCGGCC GCAGTCGTCA TCATCTGCGG CTTCACCGGG CTCAGCGGAT ACATTGTGTG 420 

m 

D GCAACACCAT GAGGCCACCG AACGCCAGCA GCGCGCCGCG GCGTTCGCCG CCGGAGCCAA 480 

§y 

GCAAGGTGTC ATCAACATGA CCTCGCTGGA CTTCAACAAG GCCAAAGAAG ACGTCGCGCG 540 

TGTGATCGAC AGCTCCACCG GCGAATTCAG GGATGACTTC CAGCAGCGGG CAGCCGATTT 600 

CACCAAGGTT GTCGAACAGT CCAAAGTGGT CACCGAAGGC ACGGTGAACG CGACAGCCGT 660 

CGAATCCATG AACGAGCATT CCGCCGTGGT GCTCGTCGCG GCGACTTCAC GGGTCACCAA 720 

TTCCGCTGGG GCGAAAGACG AACCACGTGC GTGGCGGCTC AAAGTGACCG TGACCGAAGA 780 

GGGGGGACAG TACAAGATGT CGAAAGTTGA GTTCGTACCG TGACCGATGA CGTACGCGAC 840 

GTCAACACCG AAACCACTGA CGCCACCGAA GTCGCTGAGA TCGACTCAGC CGCAGGCGAA 900 

GCCGGTGATT CGGCGACCGA GGCATTTGAC ACCGACTCTG CAACGGAATC TACCGCGCAG 960 

AAGGGTCAGC GGCACCGTGA CCTGTGGCGA ATGCAGGTTA CCTTGAAACC CGTTCCGGTG 1020 

ATTCTCATCC TGCTCATGTT GATCTCTGGG GGCGCGACGG GATGGCTATA CCTTGAGCAA 1080 

TACGACCCGA TCAGCAGACG GACTCCGGCG CCGCCCGTGC TGCCGTCGCC GCGGCGTCTG 1140 

ACGGGACAAT CGCGCTGTTG TGTATTCACC CGACACGTCG ACCAAGACTT CGCTACCGCC 1200 
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55 



AGGTCGCACC 
CTCCGGCGGC 
CGGAGCTACA 
AGGACAGCCC 
ACGGCAATTG 
TCTGACGGGG 
GCCCGACCTC 
CCACTGTATT 
TGGTGCACTT 
GTTTGCTGTC 
CTTCGGGGCG 
CTCGCTGGTG 
GTCGGCGTTT 
GGGTCTGATG 
GGCCGAGCTG 
GCTGACGGTG 
GACCAACCTC 
GATGTGGGCC 
CGAGGCGTTG 
GCAGGCCGTC 
TGTGCCCCAA 
ACTGAGTGAA 
GATGCTCAAC 
CTCAATGTTG 
CGGGGTCCAG 
TCTGGGCGCT 
GCCGCAGGCC 



TCGCCGGCGA TTTCCTGTCC 
CAAACAGAAG TCACTGAAAA 
TCCGGATTCG GCCGTCGTTC 
CAATCCGTCG ATGGCGGCCA 
GCTGATCACC AAGTTCACCC 
GCGCGGGTGG CTGCTCGTGC 
AAACAGATCT CGGCCGCTGT 
TACCTGATGT TCAGATTGTT 
TGCATCTGGG GTTGTGACTA 
CATCATTGGT GCTAGTTATG 
TTACCACCGG AGATCAACTC 
GCCGCCGCGA AGATGTGGGA 
CAGTCGGTGG TCTGGGGTCT 
GTGGCGGCGG CCTCGCCGTA 
ACCGCCGCCC AGGTCCGGGT 
CCCCCGCCGG TGATCGCCGA 
TTGGGGCAAA ACACCCCGGC 
CAAGACGCCG CCGCGATGTT 
CTGCCGTTCG AGGACGCCCC 
GCGGTCGAGG AGGCCATCGA 
GCGCTGCAAC AACTGGCCCA 
CTCTGGAAAG CCATCTCGCC 
AACCACGTGT CGATGACCAA 
AAGGGCTTTG CTCCGGCGGC 
GCGATGAGCT CGCTGGGCAG 
GGGGTGGCCG CCAACTTGGG 
TGGGCCGCGG CCAACCAGGC 



TATACGACCA GTTCACGCAG CAGATCGTGG 1260 

CCACCGCCAA GGTGGTGCGC GCGGCCGTGT 1320 

TGGTTTTTGT CGACCAGAGC ACTACCAGTA 1380 

GCAGCGTGAT GGTGACCCTA GCCAAGGTCG 1440 

CGGTTTAGGT TGCCGTAGGC GGTCGCCAAG 1500 

GAGATACCGG CCGTTCTCCG GACAATCACG 1560 

CTAATCGGCC GGGTTATTTA AGATTAGTTG 1620 

CAGCTGGATT TAGCTTCGCG GCAGGGCGGC 1680 

CTTGAGAGAA TTTGACCTGT TGCCGACGTT 174 0 

GCCGAGCGGA AGGATTATCG AAGTGGTGGA 1800 

CGCGAGGATG TACGCCGGCC CGGGTTCGGC 1860 

CAGCGTGGCG AGTGACCTGT TTTCGGCCGC 1920 

GACGACGGGA TCGTGGATAG GTTCGTCGGC 1980 

TGTGGCGTGG ATGAGCGTCA CCGCGGGGCA 2 04 0 

TGCTGCGGCG GCCTACGAGA CGGCGTATGG 2100 

GAACCGTGCT GAACTGATGA TTCTGATAGC 2160 

GATCGCGGTC AACGAGGCCG AATACGGGGA 2220 

TGGCTACGCC GCCACGGCGG CGACGGCGAC 2280 

ACTGATCACC AACCCCGGCG GGCTCCTTGA 2340 

CACCGCCGCG GCGAACCAGT TGATGAACAA 2400 

GCCCACGAAA AGCATCTGGC CGTTCGACCA 2460 

GCATCTGTCG CCGCTCAGCA ACATCGTGTC 2520 

CTCGGGTGTG TCGATGGCCA GCACCTTGCA 2580 

GGCTCAGGCC GTGGAAACCG CGGCGCAAAA 264 0 

CCAGCTGGGT TCGTCGCTGG GTTCTTCGGG 2700 

TCGGGCGGCC TCGGTCGGTT CGTTGTCGGT 2760 

GGTCACCCCG GCGGCGCGGG CGCTGCCGCT 2820 



129 

GACCAGCCTG ACCAGCGCCG CCCAAACCGC CCCCGGACAC ATGCTGGGCG GGCTACCGCT 2880 

GGGGCAACTG ACCAATAGCG GCGGCGGGTT CGGCGGGGTT AGCAATGCGT TGCGGATGCC 2 940 

GCCGCGGGCG TACGTAATGC CCCGTGTGCC CGCCGCCGGG TAACGCCGAT CCGCACGCAA 3 000 

TGCGGGCCCT CTATGCGGGC AGCGATC 302 7 
(2) INFORMATION FOR SEQ ID NO: 111: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 396 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

f=4> (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111: 

Q 

□ Val Val Asp Phe Gly Ala Leu Pro Pro Glu lie Asn Ser Ala Arg Met 

© 1 5 10 15 

gj Tyr Ala Gly Pro Gly Ser Ala Ser Leu Val Ala Ala Ala Lys Met Trp 

20 25 30 

w 

B " Asp Ser Val Ala Ser Asp Leu Phe Ser Ala Ala Ser Ala Phe Gin Ser 

Q 35 40 45 

s Li 

hi Val Trp Gly Leu Thr Thr Gly Ser Trp lie Gly Ser Ser Ala Gly 

!2 50 55 60 

" Leu Met Val Ala Ala Ala Ser Pro Tyr Val Ala Trp Met Ser Val Thr 

l y 65 70 7 5 80 

Ala Gly Gin Ala Glu Leu Thr Ala Ala Gin Val Arg Val Ala Ala Ala 
85 90 95 

Ala Tyr Glu Thr Ala Tyr Gly Leu Thr Val Pro Pro Pro Val lie Ala 
100 105 110 

Glu Asn Arg Ala Glu Leu Met lie Leu lie Ala Thr Asn Leu Leu Gly 
115 120 125 

Gin Asn Thr Pro Ala lie Ala Val Asn Glu Ala Glu Tyr Gly Glu Met 
130 135 140 

Trp Ala Gin Asp Ala Ala Ala Met Phe Gly Tyr Ala Ala Thr Ala Ala 
145 150 155 160 

Thr Ala Thr Glu Ala Leu Leu Pro Phe Glu Asp Ala Pro Leu lie Thr 
165 170 175 

Asn Pro Gly Gly Leu Leu Glu Gin Ala Val Ala Val Glu Glu Ala lie 
180 185 190 

Asp Thr Ala Ala Ala Asn Gin Leu Met Asn Asn Val Pro Gin Ala Leu 
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GCTACAAACC GCGTCAACAC AGGCCAAGAC CCGTGCGATG CAGGCGACGG CGCAAGCCGC 300 

GGCATACACC CAGGCCATGG CCACGACGCC GTCGCTGCCG GAGATCGCCG CCAACCACAT 360 

CACCCAGGCC GTCCTTACGG CCACCAACTT CTTCGGTATC AACACGATCfC CGATCGCGTT 420 

GACCGAGATG GATTATTTCA TCCGTATGTG GAACCAGGCA GCCCTGGCAA TGGAGGTCTA 480 

CCAGGC CGAG ACCGCGGTTA ACACGCTTTT CGAGAAGCTC GAGCCGATGG CGTCGATCCT 540 

TGATCCCGGC GCGAGCCAGA GCACGACGAA CCCGATCTTC GGAATGCCCT CCCCTGGCAG 600 

CTCAACACCG GTTGGCCAGT TGCCGCCGGC GGCTACCCAG ACCCTCGGCC AACTGGGTGA -660 

GATGAGCGGC CCGATSCAGC A<3CTGACCCA GCCGCTGCAG CAGGTGACGT CGTTGTTCAG 720 

CCAGGTGGGC GGCACCGGCG GCGGCAACCC AGCCGACGAG GAAGCCGCGC AGATGGGCCT 780 

GCTCGGCACC AGTCCGCTGT CGAACCATCC GCTGGCTGGT GGATCAGGCC CCAGCGCGGG 840 

CGCGGGCCTG CTGCGCGCGG AGTCGCTACC TGGCGCAGGT GGGTCGTTGA CCCGCACGCC 900 

GCTGATGTCT CAGCTGATCG AAAAGCCGGT TGCCCCCTCG GTGATGCCGG CGGCTGCTGC 960 

CGGATCGTCG GCGACGGGTG GCGCCGCTCC GGTGGGTGCG GGAGCGATGG GCCAGGGTGC 1020 

GCAATCCGGC GGCTCCACCA GGCCGGGTCT GGTCGCGCCG GCACCGCTCG CGCAGGAGCG 1080 

TGAAGAAGAC GACGAGGACG ACTGGGACGA AGAGGACGAC TGGTGAGCTC CCGTAATGAC 1140 

AACAGACTTC CCGGCCACCC GGGCCGGAAG ACTTGCCAAC ATTTTGGCGA GGAAGGTAAA 1200 

GAGAGAAAGT AGTCCAGCAT GGCAGAGATG AAGACCGATG CCGCTACCCT CGCGCAGGAG 1260 

GCAGGTAATT TCGAGCGGAT CTCCGGCGAC CTGAAAACCC AGATCGACCA GGTGGAGTCG 1320 

ACGGCAGGTT CGTTGCAGGG CCAGTGGCGC GGCGCGGCGG GGACGGC CGC CCAGGCCGCG 1380 

GTGGTGCGCT TCCAAGAAGC AGCCAATAAG CAGAAGCAGG AACTCGACGA GATCTCGACG 1440 

AATATTCGTC AGGCCGGCGT CCAATACTCG AGGGCCGACG AGGAGCAGCA GCAGGCGCTG 1500 

TCCTCGCAAA TGGGCTTCTG ACCCGCTAAT ACGAAAAGAA ACGGAGCAAA AACATGACAG 1560 

AGCAGCAGTG GAATTTCGCG GGTATCGAGG CCGCGGCAAG CGCAATCCAG GGAAAT 1616 
(2) INFORMATION FOR SEQ ID NO: 113: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 432 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

( D ) TOPOLOGY : 1 inear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113: 

CTAGTGGATG GGACCATGGC CATTTTCTGC AGTCTCACTG CCTTCTGTGT TGACATTTTG 60 

GCACGCCGGC GGAAACGAAG CACTGGGGTC GAAGAACGGC TGCGCTGCCA TATCGTCCGG 120 

AGCTTCCATA CCTTCGTGCG GCCGGAAGAG CTTGTCGTAG TCGGCCGCCA TGACAACCTC 180 

TCAGAGTGCG CTCAAACGTA TAAACACGAG AAAGGGCGAG ACCGACGGAA GGTCGAACTC 24 0 

GCCCGATCCC GTGTTTCGCT ATTCTACGCG AACTCGGCGT TGCCCTATGC GAACATCCCA 300 

GTGACGTTGC CTTCGGTCGA AGCCATTGCC TGACCGGCTT CGCTGATCGT CCGCGCCAGG 360 

TTCTGCAGCG CGTTGTTCAG CTCGGTAGCC GTGGCGTCCC ATTTTTGCTG GACACCCTGG 420 

M TACGCCTCCG AA 432 

O <2) INFORMATION FOR SEQ ID NO:114: 

CD 

Jp (i) SEQUENCE CHARACTERISTICS: 

m (A) LENGTH: 368 amino acids 

(B) TYPE: amino acid 
Ly (C) STRANDEDNESS : single 

J " (D) TOPOLOGY: linear 

o 

py (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114: 

ili 

l» Met Leu Trp His Ala Met Pro Pro Glu Xaa Asn Thr Ala Arg Leu Met 

~ 1 5 10 15 

ru 

Ala Gly Ala Gly Pro Ala Pro Met Leu Ala Ala Ala Ala Gly Trp Gin 
20 25 30 

Thr Leu Ser Ala Ala Leu Asp Ala Gin Ala Val Glu Leu Thr Ala Arg 
35 40 45 

Leu Asn Ser Leu Gly Glu Ala Trp Thr Gly Gly Gly Ser Asp Lys Ala 
50 55 60 

Leu Ala Ala Ala Thr Pro Met Val Val Trp Leu Gin Thr Ala Ser Thr 
65 70 75 80 

Gin Ala Lys Thr Arg Ala Met Gin Ala Thr Ala Gin Ala Ala Ala Tyr 
85 90 95 

Thr Gin Ala Met Ala Thr Thr Pro Ser Leu Pro Glu lie Ala Ala Asn 
100 105 110 

His lie Thr Gin Ala Val Leu Thr Ala Thr Asn Phe Phe Gly lie Asn 
115 120 125 

Thr lie Pro lie Ala Leu Thr Glu Met Asp Tyr Phe lie Arg Met Trp 
130 135 140 
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Asn Gin Ala Ala Leu Ala Met Glu Val Tyr Gin Ala Glu Thr Ala Val 
145 150 155 160 

Asn Thr Leu Phe Glu Lys Leu Glu Pro Met Ala Ser lie Leu Asp Pro 
165 170 175 

Gly Ala Ser Gin Ser Thr Thr Asn Pro lie Phe Gly Met Pro Ser Pro 
180 185 190 

Gly Ser Ser Thr Pro Val Gly Gin Leu Pro Pro Ala Ala Thr Gin Thr 
195 200 205 

Leu Gly Gin Leu Gly Glu Met Ser Gly Pro Met Gin Gin Leu Thr Gin 
210 215 220 

Pro leu Gin Gin Val Thr Ser Leu Phe Ser Gin Val Gly Gly Thr Gly 

jj-4, 225 230 235 240 

O 

p Gly Gly Asn Pro Ala Asp Glu Glu Ala Ala Gin Met Gly Leu Leu Gly 

rr~ 245 250 255 

03 



g S 3 



Thr Ser Pro Leu Ser Asn His Pro Leu Ala Gly Gly Ser Gly Pro Ser 
260 265 270 

Ala Gly Ala Gly Leu Leu Arg Ala Glu Ser Leu Pro Gly Ala Gly Gly 
L. 275 280 285 

Ser Leu Thr Arg Thr Pro Leu Met Ser Gin Leu lie Glu Lys Pro Val 

TU 290 295 300 



Ala Pro Ser Val Met Pro Ala Ala Ala Ala Gly Ser Ser Ala Thr Gly 



iU 305 310 315 320 

Gly Ala Ala Pro Val Gly Ala Gly Ala Met Gly Gin Gly Ala Gin Ser 
325 330 335 

Gly Gly Ser Thr Arg Pro Gly Leu Val Ala Pro Ala Pro Leu Ala Gin 
340 345 350 

Glu Arg Glu Glu Asp Asp Glu Asp Asp Trp Asp Glu Glu Asp Asp Trp 

355 360 365 



(2) INFORMATION FOR SEQ ID NO: 115: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 100 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115: 



Met Ala Glu Met Lys Thr Asp Ala Ala Thr Leu Ala Gin Glu Ala Gly 
15 10 15 



# 
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Asn Phe Glu Arg lie Ser Gly Asp Leu Lys Thr Gin lie Asp Gin Val 
20 25 30 

Glu Ser Thr Ala Gly Ser Leu Gin Gly Gin Trp Arg Gly Ala Ala Gly 
35 40 *45 

Thr Ala Ala Gin Ala Ala Val Val Arg Phe Gin Glu Ala Ala Asn Lys 
50 55 60 

Gin Lys Gin Glu Leu Asp Glu lie Ser Thr Asn lie Arg Gin Ala Gly 
65 70 75 80 

Val Gin Tyr Ser Arg Ala Asp Glu Glu Gin Gin Gin Ala Leu Ser Ser 
85 90 95 



Gin Met Gly Phe 

H ioo 

P 

" (2) INFORMATION FOR SEQ ID NO: 116: 

JP (i) SEQUENCE CHARACTERISTICS: 

03 (A) LENGTH: 396 base pairs 

+= (B) TYPE: nucleic acid 

W (C) STRANDEDNESS : single 

e (D) TOPOLOGY: linear 



flJ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116: 

ru 

Ln GATCTCCGGC GAC CTGAAAA CCCAGATCGA CCAGGTGGAG TCGACGGCAG GTTCGTTGCA 60 

□ 

GGGCCAGTGG CGCGGCGCGG CGGGGACGGC CGCCCAGGCC GCGGTGGTGC GCTTCCAAGA 12 0 

AGCAGCCAAT AAGCAGAAGC AGGAACTCGA CGAGATCTCG ACGAATATTC GTCAGGCCGG 180 

CGTCCAATAC TCGAGGGCCG ACGAGGAGCA GCAGCAGGCG CTGTCCTCGC AAATGGGCTT 24 0 

CTGACCCGCT AATACGAAAA GAAACGGAGC AAAAACATGA CAGAGCAGCA GTGGAATTTC 300 

GCGGGTATCG AGGCCGCGGC AAGCGCAATC CAGGGAAATG TCACGTCCAT TCATTCCCTC 360 

CTTGACGAGG GGAAGCAGTC CCTGACCAAG CTCGCA 396 
(2) INFORMATION FOR SEQ ID NO: 117: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 80 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 117: 



lie Ser Gly Asp Leu Lys Thr Gin lie Asp Gin Val Glu Ser Thr Ala 
15 10 15 
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Gly Ser Leu Gin Gly Gin Trp Arg Gly Ala Ala Gly Thr Ala Ala Gin 
20 25 30 

Ala Ala Val Val Arg Phe Gin Glu Ala Ala Asn Lys Gin Lys Gin Glu 
3 5 40 45 

Leu Asp Glu lie Ser Thr Asn lie Arg Gin Ala Gly Val Gin Tyr Ser 
50 55 60 

Arg Ala Asp Glu Glu Gin Gin Gin Ala Leu Ser Ser Gin Met Gly Phe 
65 70 75 80 



(2) INFORMATION FOR SEQ ID NO: 118: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 387 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:118: 
GTGGATCCCG ATCCCGTGTT TCGCTATTCT ACGCGAACTC GGCGTTGCCC TATGCGAACA 
TCCCAGTGAC GTTGCCTTCG GTCGAAGCCA TTGCCTGACC GGCTTCGCTG ATCGTCCGCG 
CCAGGTTCTG CAGCGCGTTG TTCAGCTCGG TAGCCGTGGC GTCCCATTTT TGCTGGACAC 
CCTGGTACGC CTCCGAACCG CTACCGCCCC AGGCCGCTGC GAGCTTGGTC AGGGACTGCT 
TCCCCTCGTC AAGGAGGGAA TGAATGGACG TGACATTTCC CTGGATTGCG CTTGCCGCGG 
CCTCGATACC CGCGAAATTC CACTGCTGCT CTGTCATGTT TTTGCTCCGT TTCTTTTCGT 
ATTAGCGGGT CAGAAGCCCA TTTGCGA 
(2) INFORMATION FOR SEQ ID NO: 119: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 272 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:119: 
CGGCACGAGG ATCTCGGTTG GCCCAACGGC GCTGGCGAGG GCTCCGTTCC GGGGGCGAGC 
TGCGCGCCGG ATGCTTCCTC TGCCCGCAGC CGCGCCTGGA TGGATGGACC AGTTGCTACC 
TTCCCGACGT TTCGTTCGGT GTCTGTGCGA TAGCGGTGAC CCCGGCGCGC ACGTCGGGAG 
TGTTGGGGGG CAGGCCGGGT CGGTGGTTCG GCCGGGGACG CAGACGGTCT GGACGGAACG 
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OGCGGGGGTT CGCCGATTGG CATCTTTGCC CA 272 
(2) INFORMATION FOR SEQ ID NO: 120: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 120: 

Asp Pro Val Asp Ala Val lie Asn Thr Thr Cys Asn Tyr Gly Gin Val 
15 10 15 

Val Ala Ala Leu 
20 



5 (2) INFORMATION FOR SEQ ID NO: 121: 

00 

+J (i) SEQUENCE CHARACTERISTICS: 

BO (A) LENGTH: 15 amino acids 

4 s " (B) TYPE: amino acid 

W (C) STRANDEDNESS : 

5 (D) TOPOLOGY: linear 

LJ 

fU (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 121: 

ru 

yl Ala Val Glu Ser Gly Met Leu Ala Leu Gly Thr Pro Ala Pro Ser 

Q 1 5 ' 10 15 

ry 

(2) INFORMATION FOR SEQ ID NO: 122: 



(i) SEQUENCE CHARACTERISTICS: 

(A) L E N GTH: 19 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:122: 

Ala Ala Met Lys Pro Arg Thr Gly Asp Gly Pro Leu Glu Ala Ala Lys 
15 10 15 



Glu Gly Arg 



(2) INFORMATION FOR SEQ ID NO: 123: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 3: 

Tyr Tyr Trp Cys Pro Gly Gin Pro Phe Asp Pro Ala Trp Gly Pro 
15 10 15 

(2) INFORMATION FOR SEQ ID NO: 124: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:124: 1 

Asp He Gly Ser Glu Ser Thr Glu Asp Gin Gin Xaa Ala Val 
15 10 

(2) INFORMATION FOR SEQ ID NO: 125: 



+; (i) SEQUENCE CHARACTERISTICS: 

BO (A) LENGTH: 13 amino acids 

4* (B) TYPE: amino acid 

UJ (C) STRANDEDNESS : 

^_ (D) TOPOLOGY: linear 

fU (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 125: 

ru 

U! Ala Glu Glu Ser He Ser Thr Xaa Glu Xaa He Val Pro 

□ 1 5 10 

rU 

(2) INFORMATION FOR SEQ ID NO: 126: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 126: 

Asp Pro Glu Pro Ala Pro Pro Val Pro Thr Thr Ala Ala Ser Pro Pro 
15 10 15 

Ser 



(2) INFORMATION FOR SEQ ID NO: 127: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 127; 

Ala Pro Lys Thr Tyr Xaa Glu Glu Leu Lys Gly Thr Asp Thr Gly 
1 5 10 15 

(2) INFORMATION FOR SEQ ID NO: 12 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 128: 

Asp Pro Ala Ser Ala Pro Asp Val Pro Thr Ala Ala Gin Leu Thr Ser 

t: 1 5 10 15 

GJ 

O Leu Leu Asn Ser Leu Ala Asp Pro Asn Val Ser Phe Ala Asn 

UjJ 20 25 30 

S3 (2) INFORMATION FOR SEQ ID NO: 129: 

W (i) SEQUENCE CHARACTERISTICS: 

s (A) LENGTH: 22 amino acids 

O (B) TYPE: amino acid 

FU (C) STRANDEDNESS: 

nj (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 129: 

Asp Pro Pro Asp Pro His Gin Xaa Asp Met Thr Lys Gly Tyr Tyr Pro 
1 5 10 15 

Gly Gly Arg Arg Xaa Phe 
20 

(2) INFORMATION FOR SEQ ID NO: 130: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 130: 

Asp Pro Gly Tyr Thr Pro Gly 
1 5 

(2) INFORMATION FOR SEQ ID NO: 131: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(D) OTHER INFORMATION: /note= "The Second Residue Can Be Either 
Pro or Thr" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 131: 

Xaa Xaa Gly Phe Thr Gly Pro Gin Phe Tyr 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 132: 

(i) SEQUENCE • GKARAGTER-IS Ti CS : 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(D) OTHER INFORMATION: /note= "The Third Residue Can Be Either 

Gin or Leu" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:132: 

Xaa Pro Xaa Val Thr Ala Tyr Ala Gly 
1 5 

(2) INFORMATION FOR SEQ ID NO: 13 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:133: 

Xaa Xaa Xaa Glu Lys Pro Phe Leu Arg 
1 5 

(2) INFORMATION FOR SEQ ID NO:134: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 134 : 



Xaa Asp Ser Glu Lys Ser Ala Thr lie Lys Val Thr Asp Ala Ser 
1 5 10 15 
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;) INFORMATION FOR SEQ ID NO: 13 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:135: 

Ala Gly Asp Thr Xaa lie Tyr lie Val Gly Asn Leu Thr Ala Asp 
15 10 15 

) INFORMATION FOR SEQ ID NO: 136: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 136: 

Ala Pro Glu Ser Gly Ala Gly Leu Gly Gly Thr Val Gin Ala Gly 
15 10 15 

) INFORMATION FOR SEQ ID NO: 13 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 137: 

Xaa Tyr lie Ala Tyr Xaa Thr Thr Ala Gly lie Val Pro Gly Lys lie 
15 10 15 

Asn Val His Leu Val 
20 

) INFORMATION FOR SEQ ID NO: 13 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 882 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:138: 



# 
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GCAACGCTGT CGTGGCCTTT GCGGTGATCG GTTTCGCCTC GCTGGCGGTG GCGGTGGCGG 60 

TCACCATCCG ACCGACCGCG GCCTCAAAAC CGGTAGAGGG ACACCAAAAC GCCCAGCCAG 120 

GGAAGTTCAT GCCGTTGTTG CCGACGCAAC AGCAGGCGCC GGTCCCGCCG CCTCCGCCCG 180 

ATGATCCCAC CGCTGGATTC CAGGGCGGCA CCATTCCGGC TGTACAGAAC GTGGTGCCGC 240 

GGCCGGGTAC CTCACCCGGG GTGGGTGGGA CGCCGGCTTC GCCTGCGCCG GAAGCGCCGG 300 

CCGTGCCCGG TGTTGTGCCT GCCCCGGTGC CAATCCCGGT CCCGATCATC ATTCCCCCGT 360 

TCCCGGGTTG GCAGCCTGGA ATGCCGACCA TCCCCACCGC ACCGCCGACG ACGCCGGTGA 42 0 

CCACGTCGGC GACGACGCCG CCGACCACGC CGCCGACCAC GCCGGTGACC ACGCCGCCAA 480 

CGACGCCGCC GACCACGCCG GTGACCACGC CGCCAACGAC GCCGCCGACC ACGCCGGTGA 540 

CCACGCCACC AACGACCGTC GCCCCGACGA CCGTCGCCCC GACGACGGTC GCTCCGACCA 600 

W CCGTCGCCCC GACCACGGTC GCTCCAGCCA CCGCCACGCC GACG AC CGTC GCTCCGCAGC 660 

sfss 

09 CGACGCAGCA GCCCACGCAA CAACCAACCC AACAGATGCC AACCCAGCAG CAGACCGTGG 720 

W CCCCGCAGAC GGTGGCGCCG GCTCCGCAGC CGCCGTCCGG TGGCCGCAAC GGCAGCGGCG 780 

□ GGGGCGACTT ATTCGGCGGG TTCTGATCAC GGTCGCGGCT TCACTACGGT CGGAGGACAT 840 

Fj GGCCGGTGAT GCGGTGACGG TGGTGCTGCC CTGTCTCAAC GA 882 

m 

fg (2) INFORMATION FOR SEQ ID NO: 139: 

ru 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 815 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 139: 

CCATCAACCA ACCGCTCGCG CCGCCCGCGC CGCCGGATCC GCCGTCGCCG CCACGCCCGC 60 

CGGTGCCTCC GGTGCCCCCG TTGCCGCCGT CGCCGCCGTC GCCGCCGACC GGCTGGGTGC 120 

CTAGGGCGCT GTTACCGCCC TGGTTGGCGG GGACGCCGCC GGCACCACCG GTACCGCCGA 180 

TGGCGCCGTT GCCGCCGGCG GCACCGTTGC CACCGTTGCC ACCGTTGCCA CCGTTGCCGA 24 0 

CCAGCCACCC GCCGCGACCA CCGGCACCGC CGGCGCCGCC CGCACCGCCG GCGTGCCCGT 300 

TCGTGCCCGT ACCGCCGGCA CCGCCGTTGC CGCCGTCACC GCCGACGGAA CTACCGGCGG 360 

ACGCGGCCTG CCCGCCGGCG CCGCCCGCAC CGCCATTGGC ACCGCCGTCA CCGCCGGCTG 420 
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GGAGTGCCGC GATTAGGGCA CTGACCGGCG CAACCAGCGC AAGTACTCTC GGTCACCGAG 480 

CACTTCCAGA CGACACCACA GCACGGGGTT GTCGGCGGAC TGGGTGAAAT GGCAGCCGAT 540 

AGCGGCTAGC TGTCGGCTGC GGTCAACCTC GATCATGATG TCGAGGTGAC CGTGACCGCG 600 

CCCCCCGAAG GAGGCGCTGA ACTCGGCGTT GAGCCGATCG GCGATCGGTT GGGGCAGTGC 660 

CCAGGCCAAT ACGGGGATAC CGGGTGTCNA AGCCGCCGCG AGCGCAGCTT CGGTTGCGCG 720 

ACNGTGGTCG GGGTGGCCTG TTACGCCGTT GTCNTCGAAC ACGAGTAGCA GGTCTGCTCC 780 

GGCGAGGGCA TCCACCACGC GTTGCGTCAG CTCGT 815 

(2) .INFORMATION FOR SEQ ID NO: 140: 

^ (i) SEQUENCE CHARACTERISTICS: 
^ (A) LENGTH: 1152 base pairs 

J~ (B) TYPE: nucleic acid 

^ (C) STRANDEDNESS : single 

g (D) TOPOLOGY: linear 



o 
ru 



(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 140: 

ACCAGCCGCC GGCTGAGGTC TCAGATCAGA GAGTCTCCGG ACTCACCGGG GCGGTTCAGC 60 

U1 CTTCTCCCAG AACAACTGCT GAAGATCCTC GCCCGCGAAA CAGGCGCTGA TTTGACGCTC 120 

nj TATGACCGGT TGAACGACGA GATCATCCGG CAGATTGATA TGGCACCGCT GGGCTAACAG 180 

GTGCGCAAGA TGGTGCAGCT GTATGTCTCG GACTCCGTGT CGCGGATCAG CTTTGCCGAC 240 

GGCCGGGTGA TCGTGTGGAG CGAGGAGCTC GGCGAGAGCC AGTATCCGAT CGAGACGCTG 300 

GACGGCATCA CGCTGTTTGG GCGGCCGACG ATGACAACGC CCTTCATCGT TGAGATGCTC 360 

AAGCGTGAGC GCGACATCCA GCTCTTCACG ACCGACGGCC ACTACCAGGG CCGGATCTCA 420 

ACACCCGACG TGTCATACGC GCCGCGGCTC CGTCAGCAAG TTCACCGCAC CGACGATCCT 48 0 

GCGTTCTGCC TGTCGTTAAG CAAGCGGATC GTGTCGAGGA AGATCCTGAA TCAGCAGGCC 54 0 

TTGATTCGGG CACACACGTC GGGGCAAGAC GTTGCTGAGA GCATCCGCAC GATGAAGCAC 600 

TCGCTGGCCT GGGTCGATCG ATCGGGCTCC CTGGCGGAGT TGAACGGGTT CGAGGGAAAT 660 

GCCGCAAAGG CATACTTCAC CGCGCTGGGG CATCTCGTCC CGCAGGAGTT CGCATTCCAG 720 

GGCCGCTCGA CTCGGCCGCC GTTGGACGCC TTCAACTCGA TGGTCAGCCT CGGCTATTCG 780 

CTGCTGTACA AGAACATCAT AGGGGCGATC GAGCGTCACA GCCTGAACGC GTATATCGGT 840 
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TTCCTACACC AGGATTCACG AGGGCACGCA ACGTCTCGTG CCGAATTCGG CACGAGCTCC 

GCTGAAACCG CTGGCCGGCT GCTCAGTGCC CGTACGTAAT CCGCTGCGCC CAGGCCGGCC 

CGCCGGCCGA ATACCAGCAG ATCGGACAGC GAATTGCCGC CCAGCCGGTT GGAGCCGTGC 

ATACCGCCGG CACACTCACC GGCAGCGAAC AGGCCTGGCA CCGTGGCGGC GCCGGTGTCC 

GCGTCTACTT CGACACCGCC CATCACGTAG TGACACGTCG GCCCGACTTC CATTGCCTGC 
GTTCGGCACG AG 

(2) INFORMATION FOR SEQ ID NO: 141: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTK-r 655 base pal^s 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 141: 
CTCGTGCCGA TTCGGCAGGG TGTACTTGCC GGTGGTGTAN GCCGCATGAG TGCCGACGAC 
CAGCAATGCG GCAACAGCAC GGATCCCGGT CAACGACGCC ACCCGGTCCA CGTGGGCGAT 
CCGCTCGAGT CCGCCCTGGG CGGCTCTTTC CTTGGGCAGG GTCATCCGAC GTGTTTCCGC 
CGTGGTTTGC CGCCATTATG CCGGCGCGCC GCGTCGGGCG GCCGGTATGG CCGAANGTCG 
ATCAGCACAC CCGAGATACG GGTCTGTGCA AGCTTTTTGA GCGTCGCGCG GGGCAGCTTC 
GCCGGCAATT CTACTAGCGA GAAGTCTGGC CCGATACGGA TCTGACCGAA GTCGCTGCGG 
TGCAGCCCAC CCTCATTGGC GATGGCGCCG ACGATGGCGC CTGGACCGAT CTTGTGCCGC 
TTGCCGACGG CGACGCGGTA GGTGGTCAAG TCCGGTCTAC GCTTGGGCCT TTGCGGACGG 
TCCCGACGCT GGTCGCGGTT GCGCCGCGAA AGCGGCGGGT CGGGTGCCAT CAGGAATGCC 
TCACCGCCGC GGCACTGCAC GGCCAGTGCC GCGGCGATGT CAGCCATCGG GACATCATGC 
TCGCGTTCAT ACTCCTCGAC CAGTCGGCGG AACAGCTCGA TTCCCGGACC GCCCA 
(2) INFORMATION FOR SEQ ID NO: 142: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 267 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 142: 

Asn Ala Val Val Ala Phe Ala Val lie Gly Phe Ala Ser Leu Ala Val 
15 10 15 

Ala Val Ala Val Thr lie Arg Pro Thr Ala Ala Ser Lys Pro Val Glu 
20 25 30 

Gly His Gin Asn Ala Gin Pro Gly Lys Phe Met Pro Leu Leu Pro Thr 
35 40 45 

Gin Gin Gin Ala Pro Val Pro Pro Pro Pro Pro Asp Asp Pro Thr Ala 
50 55 60 

Gly Phe Gin Giy Gly Thr Tie Pro Ala Val Gin Asn Val Val Pro Arg 
65 70 75 80 

Pro Gly Thr Ser Pro Gly Val Gly Gly Thr Pro Ala Ser Pro Ala Pro 
85 90 95 

Glu Ala Pro Ala Val Pro Gly Val Val Pro Ala Pro Val Pro lie Pro 
~S 100 105 110 

hi 

Val Pro lie lie lie Pro Pro Phe Pro Gly Trp Gin Pro Gly Met Pro 
JL 115 120 125 

Thr lie Pro Thr Ala Pro Pro Thr Thr Pro Val Thr Thr Ser Ala Thr 
™ 130 135 140 



Thr Pro Pro Thr Thr Pro Pro Thr Thr Pro Val Thr Thr Pro Pro Thr 
145 150 155 160 

Thr Pro Pro Thr Thr Pro Val Thr Thr Pro Pro Thr Thr Pro Pro Thr 
165 170 175 

Thr Pro Val Thr Thr Pro Pro Thr Thr Val Ala Pro Thr Thr Val Ala 
180 185 190 

Pro Thr Thr Val Ala Pro Thr Thr Val Ala Pro Thr Thr Val Ala Pro 
195 200 205 

Ala Thr Ala Thr Pro Thr Thr Val Ala Pro Gin Pro Thr Gin Gin Pro 
210 215 220 

Thr Gin Gin Pro Thr Gin Gin Met Pro Thr Gin Gin Gin Thr Val Ala 
225 230 235 240 

Pro Gin Thr Val Ala Pro Ala Pro Gin Pro Pro Ser Gly Gly Arg Asn 
245 250 255 

Gly Ser Gly Gly Gly Asp Leu Phe Gly Gly Phe 
260 265 



(2) INFORMATION FOR SEQ ID NO: 143: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 174 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 143: 



lie Asn Gin Pro Leu Ala Pro Pro Ala Pro Pro Asp Pro Pro Ser Pro 

15 10 15 

Pro Arg Pro Pro Val Pro Pro Val Pro Pro Leu Pro Pro Ser Pro Pro 

2G 25 30 



U 
CO 

c 

saaa 



Ser Pro Pro Thr Gly Trp Val Pro Arg Ala Leu Leu Pro Pro Trp Leu 
35 40 45 

Ala Gly Thr Pro Pro Ala Pro Pro Val Pro Pro Met Ala Pro Leu Pro 
50 55 60 



■f^ Pro Ala Ala Pro Leu Pro Pro Leu Pro Pro Leu Pro Pro Leu Pro Thr 

Ui 65 70 75 80 

O Ser His Pro Pro Arg Pro Pro Ala Pro Pro Ala Pro Pro Ala Pro Pro 

fy 85 90 • 95 

rU 

LH Ala Cys Pro Phe Val Pro Val Pro Pro Ala Pro Pro Leu Pro Pro Ser 

O 100 105 110 

Pro Pro Thr Glu Leu Pro Ala Asp Ala Ala Cys Pro Pro Ala Pro Pro 
115 120 125 



Ala Pro Pro Leu Ala Pro Pro Ser 
130 135 

Arg Ala Leu Thr Gly Ala Thr Ser 
145 150 

Leu Pro Asp Asp Thr Thr Ala Arg 
165 

(2) INFORMATION FOR SEQ ID NO: 144: 



Pro Pro Ala Gly Ser Ala Ala lie 
140 

Ala Ser Thr Leu Gly His Arg Ala 
155 160 

Gly Cys Arg Arg Thr Gly 
170 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO : 144 : 



00 

5 , 3 g 

w 

E 
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Gin Pro Pro Ala Glu Val Ser Asp Gin Arg Val Ser Gly Leu Thr Gly 
1 5 10 15 

Ala Val Gin Pro Ser Pro Arg Thr Thr Ala Glu Asp Pro Arg Pro Arg 
20 25 - 30 

Asn Arg Arg 
35 

(2) INFORMATION FOR SEQ ID NO: 145: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 104 amino acids 

(B) TYPE: amino acid 

\C) 3TRANBEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 145: 

Arg Ala Asp Ser Ala Gly Cys Thr Cys Arg Trp Cys Xaa Pro His Glu 
15 10 15 

Cys Arg Arg Pro Ala Met Arg Gin Gin His Gly Ser Arg Ser Thr Thr 
20 25 30 

Pro Pro Gly Pro Arg Gly Arg Ser Ala Arg Val Arg Pro Gly Arg Leu 
35 40 45 

Phe Pro Trp Ala Gly Ser Ser Asp Val Phe Pro Pro Trp Phe Ala Ala 
50 55 60 

lie Met Pro Ala Arg Arg Val Gly Arg Pro Val Trp Pro Xaa Val Asp 
65 70 75 80 

Gin His Thr Arg Asp Thr Gly Leu Cys Lys Leu Phe Glu Arg Arg Ala 
85 90 95 

Gly Gin Leu Arg Arg Gin Phe Tyr 
100 

(2) INFORMATION FOR SEQ ID NO: 146: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "PCR primer" 



(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Mycobacterium tuberculosis 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 146: 
GGATCCATAT GGGCCATCAT CATCATCATC ACGTGATCGA CATCATCGGG ACC 
(2) INFORMATION FOR SEQ ID NO: 14 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "PCR Primer" 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium tuberculosis 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 147: 

CCTGAATTCA GGCCTCGGTT GCGCCGGCCT CATCTTGAAC GA 



(2) INFORMATION FOR SEQ ID NO: 148: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "PCR Primer" 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium tuberculosis 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 148: 

GGATCCTGCA GGCTCGAAAC CACCGAGCGG T 



(2) INFORMATION FOR SEQ ID NO: 149: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "PCR primer" 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium tuberculosis 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:149: 
CTCTGAATTC AGCGCTGGAA ATCGTCGCGA T 
(2) INFORMATION FOR SEQ ID NO: 150: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "PCR primer" 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium tuberculosis 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 150: 

GGATCCAGCG CTGAGATGAA GACCGATGCC GCT 

(2) INFORMATION FOR SEQ ID NO: 151: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "PCR primer" 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium tuberculosis 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 151: 

GAGAGAATTC TCAGAAGCCC ATTTGCGAGG ACA 

(2) INFORMATION FOR SEQ ID NO: 152: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1993 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium tuberculosis 



(ix) FEATURE: 
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(A) NAME/KEY: CDS 

(B) LOCATION: 152.. 1273 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 152: 

TGTTCTTCGA CGGCAGGCTG GTGGAGGAAG GGCCCACCGA ACAGCTGTTC TCCTCGCCGA 60 

AGCATGCGGA AACCGCCCGA TACGTCGCCG GACTGTCGGG GGACGTCAAG GACGCCAAGC 120 

GCGGAAATTG AAGAGCACAG AAAGGTATGG C GTG AAA ATT CGT TTG CAT ACG 172 

Val Lys lie Arg Leu His Thr 
1 5 

CTG TTG GCC GTG TTG ACC GCT GCG CCG CTG CTG CTA GCA GCG GCG GGC 220 
Leu Leu Ala Val Leu Thr Ala Ala Pro Leu Leu Leu Ala Ala Ala Gly 

!G 1-5 2G 

t* TGT GGC TCG AAA CCA CCG AGC GGT TCG CCT GAA ACG GGC GCC GGC GCC 268 

O Cys Gly Ser Lys Pro Pro Ser Gly Ser Pro Glu Thr Gly Ala Gly Ala 

□ 25 30 35 

00 

jp GGT ACT GTC GCG ACT ACC CCC GCG TCG TCG CCG GTG ACG TTG GCG GAG 316 

m Gly Thr Val Ala Thr Thr Pro Ala Ser Ser Pro Val Thr Leu Ala Glu 

J== 40 45 50 55 

hi 

s ACC GGT AGC ACG CTG CTC TAC CCG CTG TTC AAC CTG TGG GGT CCG GCC 364 

Thr Gly Ser Thr Leu Leu Tyr Pro Leu Phe Asn Leu Trp Gly Pro Ala 
60 65 70 



fU 

; i z 
E « 

m 

B 



TTT CAC GAG AGG TAT CCG AAC GTC ACG ATC ACC GCT CAG GGC ACC GGT 412 
Phe His Glu Arg Tyr Pro. Asn Val Thr lie Thr Ala Gin Gly Thr Gly 
75 80 85 

TCT GGT GCC GGG ATC GCG CAG GCC GCC GCC GGG ACG GTC AAC ATT GGG 460 
Ser Gly Ala Gly lie Ala Gin Ala Ala Ala Gly Thr Val Asn lie Gly 
90 95 100 

GCC TCC GAC GCC TAT CTG TCG GAA GGT GAT ATG GCC GCG CAC AAG GGG 508 
Ala Ser Asp Ala Tyr Leu Ser Glu Gly Asp Met Ala Ala His Lys Gly 
105 110 115 

CTG ATG AAC ATC GCG CTA GCC ATC TCC GCT CAG CAG GTC AAC TAC AAC 556 
Leu Met Asn lie Ala Leu Ala lie Ser Ala Gin Gin Val Asn Tyr Asn 
120 125 130 135 

CTG CCC GGA GTG AGC GAG CAC CTC AAG CTG AAC GGA AAA GTC CTG GCG 604 
Leu Pro Gly Val Ser Glu His Leu Lys Leu Asn Gly Lys Val Leu Ala 
140 145 150 

GCC ATG TAC CAG GGC ACC ATC AAA ACC TGG GAC GAC CCG CAG ATC GCT 652 
Ala Met Tyr Gin Gly Thr lie Lys Thr Trp Asp Asp Pro Gin lie Ala 
155 160 165 

GCG CTC AAC CCC GGC GTG AAC CTG CCC GGC ACC GCG GTA GTT CCG CTG 700 
Ala Leu Asn Pro Gly Val Asn Leu Pro Gly Thr Ala Val Val Pro Leu 



f==ji 



~r 5 



m 
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170 175 180 

CAC CGC TCC GAC GGG TCC GGT GAC ACC TTC TTG TTC ACC CAG TAC CTG 748 
His Arg Ser Asp Gly Ser Gly Asp Thr Phe Leu Phe Thr Gin Tyr Leu 
185 190 195 

TCC AAG CAA GAT CCC GAG GGC TGG GGC AAG TCG CCC GGC TTC GGC ACC 796 
Ser Lys Gin Asp Pro Glu Gly Trp Gly Lys Ser Pro Gly Phe Gly Thr 
200 205 210 215 

ACC GTC GAC TTC CCG GCG GTG CCG GGT GCG CTG GGT GAG AAC GGC AAC 844 
Thr Val Asp Phe Pro Ala Val Pro Gly Ala Leu Gly Glu Asn Gly Asn 
220 225 230 

GGC GGC ATG GTG ACC GGT TGC GCC GAG ACA CCG GGC TGC GTG GCC TAT 892 

Gly Gly Met Val Thr Gly Gys Ala Glu Thr Pro Gly Cya Val Ala Tyr 
235 240 245 

ATC GGC ATC AGC TTC CTC GAC CAG GCC AGT CAA CGG GGA CTC GGC GAG 94 0 

lie Gly lie Ser Phe Leu Asp Gin Ala Ser Gin Arg Gly Leu Gly Glu 
250 255 260 



fg GCC CAA CTA GGC AAT AGC TCT GGC AAT TTC TTG TTG CCC GAC GCG CAA 988 

Jp Ala Gin Leu Gly Asn Ser Ser Gly Asn Phe Leu Leu Pro Asp Ala Gin 
hj 265 270 275 

p AGC ATT CAG GCC GCG GCG GCT GGC TTC GCA TCG AAA ACC CCG GCG AAC 1036 

Fy Ser lie Gin Ala Ala Ala Ala Gly Phe Ala Ser Lys Thr Pro Ala Asn 

m 280 285 290 295 



CAG GCG ATT TCG ATG ATC GAC GGG CCC GCC CCG GAC GGC TAC CCG ATC 1084 
Gin Ala lie Ser Met lie Asp Gly Pro Ala Pro Asp Gly Tyr Pro lie 
300 305 310 

ATC AAC TAC GAG TAC GCC ATC GTC AAC AAC CGG CAA AAG GAC GCC GCC 1132 
lie Asn Tyr Glu Tyr Ala lie Val Asn Asn Arg Gin Lys Asp Ala Ala 
315 320 325 

ACC GCG CAG ACC TTG CAG GCA TTT CTG CAC TGG GCG ATC ACC GAC GGC 1180 
Thr Ala Gin Thr Leu Gin Ala Phe Leu His Trp Ala lie Thr Asp Gly 
330 335 340 

AAC AAG GCC TCG TTC CTC GAC CAG GTT CAT TTC CAG CCG CTG CCG CCC 1228 
Asn Lys Ala Ser Phe Leu Asp Gin Val His Phe Gin Pro Leu Pro Pro 
345 350 355 

GCG GTG GTG AAG TTG TCT GAC GCG TTG ATC GCG ACG ATT TCC AGC 1273 
Ala Val Val Lys Leu Ser Asp Ala Leu lie Ala Thr lie Ser Ser 
360 365 370 

TAGCCTCGTT GACCACCACG CGACAGCAAC CTCCGTCGGG CCATCGGGCT GCTTTGCGGA 1333 

GCATGCTGGC CCGTGCCGGT GAAGTCGGCC GCGCTGGCCC GGCCATCCGG TGGTTGGGTG 1393 

GGATAGGTGC GGTGATCCCG CTGCTTGCGC TGGTCTTGGT GCTGGTGGTG CTGGTCATCG 1453 
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AGGCGATGGG TGCGATCAGG CTCAACGGGT TGCATTTCTT CACCGCCACC GAATGGAATC 1513 

CAGGCAACAC CTACGGCGAA ACCGTTGTCA CCGACGCGTC GCCCATCCGG TCGGCGCCTA 1573 

CTACGGGGCG TTGCCGCTGA TCGTCGGGAC GCTGGCGACC TCGGCAATCG CCCTGATCAT 1633 

CGCGGTGCCG GTCTCTGTAG GAGCGGCGCT GGTGATCGTG GAACGGCTGC CGAAACGGTT 16 93 

GGCCGAGGCT GTGGGAATAG TCCTGGAATT GCTCGCCGGA ATCCCCAGCG TGGTCGTCGG 1753 

TTTGTGGGGG GCAATGACGT TCGGGCCGTT CATCGCTCAT CACATCGCTC CGGTGATCGC 1813 

TCACAACGCT CCCGATGTGC CGGTGCTGAA CTACTTGCGC GGCGACCCGG GCAACGGGGA 1873 

GGGCATGTTG GTGTCCGGTC TGGTGTTGGC GGTGATGGTC GTTCCCATTA TCGCCACCAC 1933 

M 5 CACTCATGAC CTGTTCCGGC AGGTGCCGGT GTTGCCCCGG GAGGGCGCGA TCGGGAATTC 1993 

g 

D (2) INFORMATION FOR SEQ ID NO: 153: 

yy 

4? (i) SEQUENCE CHARACTERISTICS: 

CO (A) LENGTH: 374 amino acids 

JS (B) TYPE: amino acid 

Id (D) TOPOLOGY: linear 

s 

Q (ii) MOLECULE TYPE: protein 

fU 

fy (xi) SEQUENCE DESCRIPTION: SEQ ID NO:153: 

m 



m 

8 xs? 



Val Lys lie Arg Leu His Thr Leu Leu Ala Val Leu Thr Ala Ala Pro 
1 5 10 15 

Leu Leu Leu Ala Ala Ala Gly Cys Gly Ser Lys Pro Pro Ser Gly Ser 
20 25 30 

Pro Glu Thr Gly Ala Gly Ala Gly Thr Val Ala Thr Thr Pro Ala Ser 
35 40 45 

Ser Pro Val Thr Leu Ala Glu Thr Gly Ser Thr Leu Leu Tyr Pro Leu 
50 55 60 

Phe Asn Leu Trp Gly Pro Ala Phe His Glu Arg Tyr Pro Asn Val Thr 
65 70 75 80 

lie Thr Ala Gin Gly Thr Gly Ser Gly Ala Gly lie Ala Gin Ala Ala 
85 90 95 

Ala Gly Thr Val Asn lie Gly Ala Ser Asp Ala Tyr Leu Ser Glu Gly 
100 105 110 

Asp Met Ala Ala His Lys Gly Leu Met Asn lie Ala Leu Ala lie Ser 
115 120 125 

Ala Gin Gin Val Asn Tyr Asn Leu Pro Gly Val Ser Glu His Leu Lys 
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130 135 140 

Leu Asn Gly Lys Val Leu Ala Ala Met Tyr Gin Gly Thr lie Lys Thr 
145 150 155 160 

Trp Asp Asp Pro Gin lie Ala Ala Leu Asn Pro Gly Val Asn Leu Pro 
165 170 175 

Gly Thr Ala Val Val Pro Leu His Arg Ser Asp Gly Ser Gly Asp Thr 
180 185 190 

Phe Leu Phe Thr Gin Tyr Leu Ser Lys Gin Asp Pro Glu Gly Trp Gly 
195 200 205 

Lys Ser Pro Gly Phe Gly Thr Thr Val Asp Phe Pro Ala Val Pro Gly 

210 21-5 22G 

h* Ala Leu Gly Glu Asn Gly Asn Gly Gly Met Val Thr Gly Cys Ala Glu 

B 225 230 235 240 

o 

EQ Thr Pro Gly Cys Val Ala Tyr lie Gly lie Ser Phe Leu Asp Gin Ala 

245 250 255 

jp Ser Gin Arg Gly Leu Gly Glu Ala Gin Leu Gly Asn Ser Ser Gly Asn 

yy 260 265 270 

O 

ru 
ry 

Lfl 

m 



Phe Leu Leu Pro Asp Ala Gin Ser lie Gin Ala Ala Ala Ala Gly Phe 
275 280 285 

Ala Ser Lys Thr Pro Ala Asn Gin Ala lie Ser Met lie Asp Gly Pro 

290 295 300 

Ala Pro Asp Gly Tyr Pro lie lie Asn Tyr Glu Tyr Ala lie Val Asn 
305 310 315 320 



Asn Arg Gin Lys Asp Ala Ala Thr Ala Gin Thr Leu Gin Ala Phe Leu 
325 330 335 

His Trp Ala lie Thr Asp Gly Asn Lys Ala Ser Phe Leu Asp Gin Val 
340 345 350 

His Phe Gin Pro Leu Pro Pro Ala Val Val Lys Leu Ser Asp Ala Leu 
355 360 365 



lie Ala Thr lie Ser Ser 
370 

(2) INFORMATION FOR SEQ ID NO: 154: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1993 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 154: 

TGTTCTTCGA CGGCAGGCTG GTGGAGGAAG GGCCCACCGA ACAGCTGTTC TCCTCGCCGA 60 

AGCATGCGGA AACCGCCCGA TACGTCGCCG GACTGTCGGG GGACGTCAAG GACGCCAAGC 120 

GCGGAAATTG AAGAGCACAG AAAGGTATGG CGTGAAAATT CGTTTGCATA CGCTGTTGGC 180 

CGTGTTGACC GCTGCGCCGC TGCTGCTAGC AGCGGCGGGC TGTGGCTCGA AACCACCGAG 240 

CGGTTCGCCT GAAACGGGCG CCGGCGCCGG TACTGTCGCG ACTACCCCCG CGTCGTCGCC '3 00 

GGTGACGTTG GCGGAGACCG GTAGCACGCT GCTCTACCCG CTGTTCAACC TGTGGGGTCC 360 

GGCCTTTCAC GAGAGGTATC CGAACGTCAC GATCACCGCT CAGGGCACCG GTTCTGGTGC 420 

CGGGATCGCG CAGGCCGCCG CCGGGACGGT CAACATTGGG GCCTCCGACG CCTATCTGTC 480 

P GGAAGGTGAT ATGGCCGCGC ACAAGGGGCT GATGAACATC GCGCTAGCCA TCTCCGCTCA 540 

o 

CO GCAGGTCAAC TACAACCTGC CCGGAGTGAG CGAGCACCTC AAGCTGAACG GAAAAGTCCT 600 

CO GGCGGCCATG TACCAGGGCA CCATCAAAAC CTGGGACGAC CCGCAGATCG CTGCGCTCAA 660 

jP 

LJ CCCCGGCGTG AACCTGCCCG GCACCGCGGT AGTTCCGCTG CACCGCTCCG ACGGGTCCGG 720 

s 

Q TGACACCTTC TTGTTCACCC AGTACCTGTC CAAGCAAGAT CCCGAGGGCT GGGGCAAGTC 780 

m 

ft! GCCCGGCTTC GGCACCACCG TCGACTTCCC GGCGGTGCCG GGTGCGCTGG GTGAGAACGG 840 

in 

Q CAACGGCGGC ATGGTGACCG GTTGCGCCGA GACACCGGGC TGCGTGGCCT ATATCGGCAT 900 

fU 

CAGCTTCCTC GACCAGGCCA GTCAACGGGG ACTCGGCGAG GCCCAACTAG GCAATAGCTC 960 

TGGCAATTTC TTGTTGCCCG ACGCGCAAAG CATTCAGGCC GCGGCGGCTG GCTTCGCATC 1020 

GAAAACCCCG GCGAACCAGG CGATTTCGAT GATCGACGGG CCCGCCCCGG ACGGCTACCC 1080 

GATCATCAAC TACGAGTACG CCATCGTCAA CAACCGGCAA AAGGACGCCG CCACCGCGCA 1140 

GACCTTGCAG GCATTTCTGC ACTGGGCGAT CACCGACGGC AACAAGGCCT CGTTCCTCGA 1200 

C CAGGTT CAT TTCCAGCCGC TGCCGCCCGC GGTGGTGAAG TTGTCTGACG CGTTGATCGC 1260 

GACGATTTCC AGCTAGCCTC GTTGACCACC ACGCGACAGC AACCTCCGTC GGGCCATCGG 1320 

GCTGCTTTGC GGAGCATGCT GGCCCGTGCC GGTGAAGTCG GCCGCGCTGG CCCGGCCATC 1380 

CGGTGGTTGG GTGGGATAGG TGCGGTGATC CCGCTGCTTG CGCTGGTCTT GGTGCTGGTG 1440 

GTGCTGGTCA TCGAGGCGAT GGGTGCGATC AGGCTCAACG GGTTGCATTT CTTCACCGCC 1500 

ACCGAATGGA ATCCAGGCAA CACCTACGGC GAAACCGTTG TCACCGACGC GTCGCCCATC 1560 

CGGTCGGCGC CTACTACGGG GCGTTGCCGC TGATCGTCGG GACGCTGGCG ACCTCGGCAA 1620 
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TCGCCCTGAT CATCGCGGTG CCGGTCTCTG TAGGAGCGGC GCTGGTGATC GTGGAACGGC 1680 

TGCCGAAACG GTTGGCCGAG GCTGTGGGAA TAGTCCTGGA ATTGCTCGCC GGAATCCCCA 1740 

GCGTGGTCGT CGGTTTGTGG GGGGCAATGA CGTTCGGGCC GTTCATCGCT CATCACATCG 1800 

CTCCGGTGAT CGCTCACAAC GCTCCCGATG TGCCGGTGCT GAACTACTTG CGCGGCGACC 1860 

CGGGCAACGG GGAGGGCATG TTGGTGTCCG GTCTGGTGTT GGCGGTGATG GTCGTTCCCA 1920 

TTATCGCCAC CACCACTCAT GACCTGTTCC GGCAGGTGCC GGTGTTGCCC CGGGAGGGCG 1980 

CGATCGGGAA TTC 1993 
( 2 ) INFORMATION TOR SEQ ID NO : 15 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 374 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 155: 

Met Lys lie Arg Leu His Thr Leu Leu Ala Val Leu Thr Ala Ala Pro 
1 5 10 15 

Leu Leu Leu Ala Ala Ala Gly Cys Gly Ser Lys Pro Pro Ser Gly Ser 
20 25 30 

Pro Glu Thr Gly Ala Gly Ala Gly Thr Val Ala Thr Thr Pro Ala Ser 
35 40 45 

Ser Pro Val Thr Leu Ala Glu Thr Gly Ser Thr Leu Leu Tyr Pro Leu 

50 55 - 60 

Phe Asn Leu Trp Gly Pro Ala Phe His Glu Arg Tyr Pro Asn Val Thr 
65 70 75 80 

lie Thr Ala Gin Gly Thr Gly Ser Gly Ala Gly He Ala Gin Ala Ala 
85 90 95 

Ala Gly Thr Val Asn He Gly Ala Ser Asp Ala Tyr Leu Ser Glu Gly 
100 105 110 

Asp Met Ala Ala His Lys Gly Leu Met Asn He Ala Leu Ala He Ser 
115 120 125 

Ala Gin Gin Val Asn Tyr Asn Leu Pro Gly Val Ser Glu His Leu Lys 
130 135 140 

Leu Asn Gly Lys Val Leu Ala Ala Met Tyr Gin Gly Thr He Lys Thr 
145 150 155 160 
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Trp Asp Asp Pro Gin lie Ala Ala Leu Asn Pro Gly Val Asn Leu Pro 
165 170 175 

Gly Thr Ala Val Val Pro Leu His Arg Ser Asp Gly Ser Gly Asp Thr 
180 185 190 

Phe Leu Phe Thr Gin Tyr Leu Ser Lys Gin Asp Pro Glu Gly Trp Gly 
195 200 205 

Lys Ser Pro Gly Phe Gly Thr Thr Val Asp Phe Pro Ala Val Pro Gly 
210 215 220 

Ala Leu Gly Glu Asn Gly Asn Gly Gly Met Val Thr Gly Cys Ala Glu 
225 230 235 240 

Thr Pro Gly Cys Va-1 Ala Tyr lie Gly Tie Ser Phe Leu Asp Gin Ala 
245 250 255 

Ser Gin Arg Gly Leu Gly Glu Ala Gin Leu Gly Asn Ser Ser Gly Asn 
260 265 270 

Phe Leu Leu Pro Asp Ala Gin Ser He Gin Ala Ala Ala Ala Gly Phe 
275 280 285 

Ala Ser Lys Thr Pro Ala Asn Gin Ala He Ser Met He Asp Gly Pro 
290 295 300 

Ala Pro Asp Gly Tyr Pro He He Asn Tyr Glu Tyr Ala He Val Asn 
305 310 315 320 

Asn Arg Gin Lys Asp Ala Ala Thr Ala Gin Thr Leu Gin Ala Phe Leu 
325 330 335 

His Trp Ala He Thr Asp Gly Asn Lys Ala Ser Phe Leu Asp Gin Val 
340 345 350 

His Phe Gin Pro Leu Pro Pro Ala Val Val Lys Leu Ser Asp Ala Leu 
355 360 365 

He Ala Thr lie Ser Ser 
370 

(2) INFORMATION FOR SEQ ID NO: 156: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1777 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 156: 
GGTCTTGACC ACCACCTGGG TGTCGAAGTC GGTGCCCGGA TTGAAGTCCA GGTACTCGTG 



GGTGGGGCGG GCGAAACAAT AGCGACAAGC ATGCGAGCAG CCGCGGTAGC CGTTGACGGT 
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GTAGCGAAAC GGCAACGCGG CCGCGTTGGG 
CTCGTGGAAG GTGATGCCGT CGAATTGTGG 
CTGCAACCCG GCAGCGCCCG TCGTCAACGG 
CCCAACGCAT ACCATTATTC GAACAACCGT 
AGCGCCGCAC AGGATGTGAT ATGC CATCTC 
ACAGCATTCG GCGTCGAGCC CTACGGGCAG 
ATGGCGTATA TCGACGAAGG CAAGGGTGAC 
TCGTCTTAGT TS-TSGGGGAA CATGATGCCG 
g TGCGATCTGA TCGGGATGGG CGCGTCGGAC 

5 AGCTATGGCG AGCAACGAGA CTTTTTGTTC 

CO 

4- CACGTGGTAC TGGTGCTGCA CGACTGGGGC 

CO 

HF CATCGCGACC GAGTGCAGGG GATCGCGTTC 

S = 8 

2 GCGGACTGGC CGCCGGCCGT GCGGGGTGTG 

□ 

RJ CCAATGGCGT TGGAGCACAA CATCTTTGTC 

ry 

Lfl CAGCTCAGCG ACGAGGAAAT GAACCACTAT 

□ 

ffj CGTCGCCCCA CGTTGTCGTG GCCACGAAAC 

GTCGCGTTGG TCAACGAGTA CCGGAGCTGG 
ATCAACGCCG AGCCCGGCGC GATCATCACC 
CCCAACCAGA CCGAAATCAC AGTGCCCGGC 
GTCGTATCGT GGGCGGGCGC TCGGCAGCAT 
GACCAAGAAT GTGATTTCCG GCGAAGGCGG 
GCTCCGGGCA GAGATTCTCA GGGAAAAGGG 
GGTCGACAAA TATACGTGGC AGGACAAAGG 
GCCTTTCTAT GGGCTCAGTT CGAGGAAGCC 
TGGAACCGGT ATCATGAAAG CTTCGAATCA 
ATCATCCCAC AGGGCTGCTC TCACAACGCC 
GCCGATCGGG AGGAGGTGCT GGCGCGTCTG 



CACCTTGTTC AGCGCTGATT TGCACAACAC 180 

CGCGCGAACG CTGCGGACCA GGCCGATCCG 240 

GCATCCCGTT CACCGCGACG GCTTGCCGGG 300 

TCTATACTTT GTCAACGCTG GCCGCTACCG 360 

TGCCCGCACA GACAGGAGCC AGGCCTTATG 420 

CCGAAGTACC TAGAAATCGC CGGGAAGCGC 480 

GCCATCGTCT TTCAGCACGG CAACCCCACG 540 

CACTTGGAAG GGCTGGGCCG GCTGGTGGCC 600 

AAGCTCAGCC CATCGGGACC CGACCGCTAT 660 

GCGCTCTGGG ATGCGCTCGA CCTCGGCGAC 720 

TCGGCGCTCG GCTTCGACTG GGCTAACCAG 780 

ATGGAAGCGA TCGTCACCCC GATGACGTGG 840 

TTCCAGGGTT TCCGATCGCC TCAAGGCGAG 900 

GAACGGGTGC TGCCCGGGGC GATC CTGCG A 960 

CGGCGGCCAT TCGTGAACGG CGGCGAGGAC 1020 

CTTCCAATCG ACGGTGAGCC CGCCGAGGTC 1080 

CTCGAGGAAA CCGACATGCC GAAACTGTTC 1140 

GGCCGCATCC GTGACTATGT CAGGAGCTGG 1200 

GTGCATTTCG TTCAGGAGGA CAGCGATGGC 1260 

CGGCGACCTG GGAGCGCTCT CATTTCACGA 1320 

CGCCCTGCTT GTCAACTCAT AAGACTTCCT 1380 

CACCAATCGC AGCCGCTTCC TTCGCAACGA 1440 

TCTTCCTATT TGCCCAGCGA ATTAGTCGCT 1500 

GAGCGGATCA CGCGTATCCG ATTGGACCTA 1560 

TTGGAACAGC GGGGGCTCCT GCGCCGTCCG 1620 

CACATGTACT ACGTGTTACT AGCGCCCAGC 1680 

ACGAGCGAAG GTATAGGCGC GGTCTTTCAT 1740 
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TACGTGCCGC TTCACGATTC GCCGGCCGGG CGTCGCT 1777 
(2) INFORMATION FOR SEQ ID NO: 157: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 324 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 157: 

TCCTTAGCGG CTCCGTCCCG TGAATGCCCA TATCACGCAC 60 

CCTTCGCCCC ATGCCCGGAC GTTGGTAAAC CCAGGGTTTG 120 

GTTGCGGGAA GGCGGCCAGG ATGTGCGTGA GCCGCGGCGC 180 

CTGGATGCTC AGCCCCGGTG CGGCGACGTA GCCAGCGTTT 24 0 

GTACTCCGGT GACGACGCGG CGCGGTGCCT GGGTGAAGAC 300 

CAGA 324 
W (2) INFORMATION FOR SEQ ID NO: 158: 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1338 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

nJ 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 158: 

GCGGTACCGC CGCGTTGCGC TGGCACGGGA CCTGTACGAC CTGAACCACT TCGCCTCGCG 60 

AACGATTGAC GAACCGCTCG TGCGGCGGCT GTGGGTGCTC AAGGTGTGGG GTGATGTCGT 120 

CGATGACCGG CGCGGCACCC GGCCACTACG CGTCGAAGAC GTCCTCGCCG CCCGCAGCGA 180 

GCACGACTTC CAGCCCGACT CGATCGGCGT GCTGACCCGT CCTGTCGCTA TGGCTGCCTG 240 

GGAAGCTCGC GTTCGGAAGC GATTTGCGTT CCTCACTGAC CTCGACGCCG ACGAGCAGCG 300 

GTGGGCCGCC TGCGACGAAC GGCACCGCCG CGAAGTGGAG AACGCGCTGG CGGTGCTGCG 360 

GTCCTGATCA ACCTGCCGGC GATCGTGCCG TTCCGCTGGC ACGGTTGCGG CTGGACGCGG 420 

CTGAATCGAC TAGATGAGAG CAGTTGGGCA CGAATCCGGC TGTGGTGGTG AGCAAGACAC 480 

GAGTACTGTC ATCACTATTG GATGCACTGG ATGACCGGCC TGATTCAGCA GGACCAATGG 54 0 

AACTGCCCGG GGCAAAACGT CTCGGAGATG ATCGGCGTCC CCTCGGAACC CTGCGGTGCT 600 

GGCGTCATTC GGACATCGGT CCGGCTCGCG GGATCGTGGT GACGCCAGCG CTGAAGGAGT 660 



o 

Q 



GAGATTGAAT CGTACCGGTC 
GGCCATGTTC TGGCTGTCGA 
ATCAGTAATT CCGGGGGACG 
CGCCGTCGCC CAGGCGACCG 
GGCGCGTGTC GTCCACAGTG 



CO CGTGACCGAC GCCGCCGATT 



U (i) 

ru 

ni 
pi 
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GGAGCGCGGC GGTGCACGCG CTGCTGGACG GCCGGCAGAC GGTGCTGCTG CGTAAGGGCG 720 

GGATCGGCGA GAAGCGCTTC GAGGTGGCGG CCCACGAGTT CTTGTTGTTC CCGACGGTCG 780 

CGCACAGCCA CGCCGAGCGG GTTCGCCCCG AGCACCGCGA CCTGCTGGGC CCGGCGGCCG 840 

CCGACAGCAC CGACGAGTGT GTGCTACTGC GGGCCGCAGC GAAAGTTGTT GCCGCACTGC 900 

CGGTTAACCG GCCAGAGGGT CTGGACGCCA TCGAGGATCT GCACATCTGG ACCGCCGAGT 960 

CGGTGCGCGC CGACCGGCTC GACTTTCGGC CCAAGCACAA ACTGGCCGTC TTGGTGGTCT 1020 

CGGCGATCCC GCTGGCCGAG CCGGTCCGGC TGGCGCGTAG GCCCGAGTAC GGCGGTTGCA 1080 

CCAGCTGGGT GCAGCTGCCG GTGACGCCGA CGTTGGCGGC GCCGGTGCAC GACGAGGCCG i!4"0 

y? CGCTGGCCGA GGTCGCCGCC CGGGTCCGCG AGGCCGTGGG TTGACTGGGC GGCATCGCTT 1200 

O GGGTCTGAGC TGTACGCCCA GTCGGCGCTG CGAGTGATCT GCTGTCGGTT CGGTCCCTGC 1260 

m 

JG TGGCGTCAAT TGACGGCGCG GGCAACAGCA GCATTGGCGG CGCCATCCTC CGCGCGGCCG 1320 

03 

«p GCGCCCACCG CTACAACC 1338 

ill 

■ (2) INFORMATION FOR SEQ ID NO: 159: 

fy (i) SEQUENCE CHARACTERISTICS: 

fjj (A) LENGTH: 321 base pairs 

ySj <B) TYPE: nucleic acid ' 

g (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 159: 

CCGGCGGCAC CGGCGGCACC GGCGGTACCG GCGGCAACGG CGCTGACGCC GCTGCTGTGG 60 

TGGGCTTCGG CGCGAACGGC GACCCTGGCT TCGCTGGCGG CAAAGGCGGT AACGGCGGAA 120 

TAGGTGGGGC CGCGGTGACA GGCGGGGTCG CCGGCGACGG CGGCACCGGC GGCAAAGGTG 180 

GCACCGGCGG TGCCGGCGGC GCCGGCAACG ACGC CGGCAG CACCGGCAAT CCCGGCGGTA 240 

AGGGCGGCGA CGGCGGGATC GGCGGTGCCG GCGGGGCCGG CGGCGCGGCC GGCACCGGCA 300 

ACGGCGGCCA TGCCGGCAAC C 321 



(2) INFORMATION FOR SEQ ID NO: 160: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 92 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 160: 
GAAGACCCGG CCCCGCCATA TCGATCGGCT CGCCGACTAC TTTCGCCGAA CGTGCACGCG 60 

GCGGCGTCGG GCTGATCATC ACCGGTGGCT ACGCGCCCAA CCGCACCGGA TGGCTGCTGC 120 

CGTTCGCCTC CGAACTCGTC ACTTCGGCGC AAGCCCGACG GCACCGCCGA ATCACCAGGG 180 

CGGTCCACGA TTCGGGTGCA AAGATCCTGC TGCAAATCCT GCACGCCGGA CGCTACGCCT 240 

ACCACCCACT TGCGGTCAGC GCCTCGCCGA TCAAGGCGCC GATCACCCCG TTTCGTCCGC 300 

GAGCACTATC GGCTCGCGGG GTCGAAGCGA CCATCGCGGA TTTCGCCCGC TGCGCGCAGT 360 

TGGCCCGCGA TGCCGGCTAC GACGGCGTCG AAATCATGGG CAGCGAAGGG TATCTGCTCA 420 

ATCAGTTCCT GGCGCCGCGC ACCAACAAGC GCACCGACTC GTGGGGCGGC ACACCGGCCA 480 

y ACCGTCGCCG GT 4 92 

Z 3 

09 (2) INFORMATION FOR SEQ ID NO: 161: 

PB (i) SEQUENCE CHARACTERISTICS: 

4* (A) LENGTH: 536 amino acids 

W (B) TYPE: amino acid 

l_ ( C ) STRANDEDNESS : 

□ (D) TOPOLOGY: linear 

ru 

fU (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 161: 

in 

Q Phe Ala Gin His Leu Val Glu Gly Asp Ala Val Glu Leu Trp Arg Ala 

m 1 5 10 15 

Asn Ala Ala Asp Gin Ala Asp Pro Leu Gin Pro Gly Ser Ala Arg Arg 
20 25 30 

Gin Arg Ala Ser Arg Ser Pro Arg Arg Leu Ala Gly Pro Asn Ala Tyr 
35 40 45 

His Tyr Ser Asn Asn Arg Ser lie Leu Cys Gin Arg Trp Pro Leu Pro 
50 55 60 

Ser Ala Ala Gin Asp Val lie Cys His Leu Cys Pro His Arg Gin Glu 
65 70 75 80 

Pro Gly Leu Met Thr Ala Phe Gly Val Glu Pro Tyr Gly Gin Pro Lys 
85 90 95 

Tyr Leu Glu lie Ala Gly Lys Arg Met Ala Tyr lie Asp Glu Gly Lys 
100 105 110 

Gly Asp Ala lie Val Phe Gin His Gly Asn Pro Thr Ser Ser Tyr Leu 
115 120 125 

Trp Arg Asn lie Met Pro His Leu Glu Gly Leu Gly Arg Leu Val Ala 
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130 135 140 

Cys Asp Leu lie Gly Met Gly Ala Ser Asp Lys Leu Ser Pro Ser Gly 
145 150 155 160 

Pro Asp Arg Tyr Ser Tyr Gly Glu Gin Arg Asp Phe Leu Phe Ala Leu 
165 170 175 

Trp Asp Ala Leu Asp Leu Gly Asp His Val Val Leu Val Leu His Asp 
180 185 190 

Trp Gly Ser Ala Leu Gly Phe Asp Trp Ala Asn Gin His Arg Asp Arg 
195 200 205 

Val Gin Gly lie Ala Phe Met Glu Ala lie Val Thr Pro Met Thr Trp 

210 215 220 

Ala Asp Trp Pro Pro Ala Val Arg Gly Val Phe Gin Gly Phe Arg Ser 
225 230 235 240 

Pro Gin Gly Glu Pro Met Ala Leu Glu His Asn lie Phe Val Glu Arg 
245 250 255 

Val Leu Pro Gly Ala lie Leu Arg Gin Leu Ser Asp Glu Glu Met Asn 
260 265 270 

His Tyr Arg Arg Pro Phe Val Asn Gly Gly Glu Asp Arg Arg Pro Thr 
275 280 285 

Leu Ser Trp Pro Arg Asn Leu Pro lie Asp Gly Glu Pro Ala Glu Val 
290 295 300 

Val Ala Leu Val Asn Glu Tyr Arg Ser Trp Leu Glu Glu Thr Asp Met 
305 310 315 320 

Pro Lys Leu Phe lie Asn Ala Glu Pro Gly Ala lie He Thr Gly Arg 
325 330 335 

He Arg Asp Tyr Val Arg Ser Trp Pro Asn Gin Thr Glu He Thr Val 
340 345 350 

Pro Gly Val His Phe Val Gin Glu Asp Ser Asp Gly Val Val Ser Trp 
355 360 365 

Ala Gly Ala Arg Gin His Arg Arg Pro Gly Ser Ala Leu He Ser Arg 
370 375 380 

Asp Gin Glu Cys Asp Phe Arg Arg Arg Arg Arg Pro Ala Cys Gin Leu 
385 390 395 400 

He Arg Leu Pro Ala Pro Gly Arg Asp Ser Gin Gly Lys Gly His Gin 
405 410 415 

Ser Gin Pro Leu Pro Ser Gin Arg Gly Arg Gin He Tyr Val Ala Gly 
420 425 430 



Q 



m 

m 

RJ 
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Gin Arg Ser Ser Tyr Leu Pro Ser Glu Leu Val Ala Ala Phe Leu Trp 
435 440 445 

Ala Gin Phe Glu Glu Ala Glu . Arg lie Thr Arg lie Arg Leu Asp Leu 
450 455 460 

Trp Asn Arg Tyr His Glu Ser Phe Glu Ser Leu Glu Gin Arg Gly Leu 
465 470 475 480 

Leu Arg Arg Pro lie lie Pro Gin Gly Cys Ser His Asn Ala His Met 
485 490 495 

Tyr Tyr Val Leu Leu Ala Pro Ser Ala Asp Arg Glu Glu Val Leu Ala 
500 . 505 510 

Arg Leu Thr Ser Glu Gly He Gly Ala Val Phe His Tyr Val Pro Leu 
515 520 525 

His Asp Ser Pro Ala Gly Arg Arg 
530 535 

(2) INFORMATION FOR SEQ ID NO: 162: 



(i) SEQUENCE CHARACTERISTICS: 
Z_ (A) LENGTH: 284 amino acids 

O (B) TYPE: amino acid 

RJ (C) STRANDEDNESS : 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 162: 

Asn Glu Ser Ala Pro Arg Ser Pro Met Leu Pro Ser Ala Arg Pro Arg 
1 5 10 15 

Tyr Asp Ala He Ala Val Leu Leu Asn Glu Met His Ala Gly His Cys 
20 25 30 

Asp Phe Gly Leu Val Gly Pro Ala Pro Asp He Val Thr Asp Ala Ala 
35 40 45 

Gly Asp Asp Arg Ala Gly Leu Gly Val Asp Glu Gin Phe Arg His Val 
50 55 60 

Gly Phe Leu Glu Pro Ala Pro Val Leu Val Asp Gin Arg Asp Asp Leu 
65 70 75 80 

Gly Gly Leu Thr Val Asp Trp Lys Val Ser Trp Pro Arg Gin Arg Gly 
85 90 95 

Ala Thr Val Leu Ala Ala Val His Glu Trp Pro Pro He Val Val His 
100 105 110 

Phe Leu Val Ala Glu Leu Ser Gin Asp Arg Pro Gly Gin His Pro Phe 
115 120 125 



ru 
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Asp L»ys Asp Val Val Leu Gin Arg His Trp Leu Ala Leu Arg Arg Ser 
130 135 140 

Glu Thr Leu Glu His Thr Pro His Gly Arg Arg Pro Val Arg Pro Arg 
145 150 155 - 160 

His Arg Gly Asp Asp Arg Phe His Glu Arg Asp Pro Leu His Ser Val 
165 170 175 

Ala Met Leu Val Ser Pro Val Glu Ala Glu Arg Arg Ala Pro Val Val 
180 185 190 

Gin His Gin Tyr His Val Val Ala Glu Val Glu Arg lie Pro Glu Arg 
195 200 205 

Glu Gin Lys Val Ser Leu Leu Ala lie Ala lie Ala Val Gly Ser Arg 
210 215 220 

Trp Ala Glu Leu Val Arg Arg Ala His Pro Asp Gin lie Ala Gly His 
225 230 235 240 

Gin Pro Ala Gin Pro Phe Gin Val Arg His Asp Val Ala Pro Gin Val 
245 250 255 



00 

5 

b Arg Arg Arg Gly Val Ala Val Leu Lys Asp Asp Gly Val Thr Leu Ala 

O 260 265 270 

fy 

fU Pke Val Asp lie Arg His Ala Leu Pro Gly Asp Phe 

iff 275 280 



(2) INFORMATION FOR SEQ ID NO: 163: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 264 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 163: 

ATGAACATGT CGTCGGTGGT GGGTCGCAAG GCCTTTGCGC GATTCGCCGG CTACTCCTCC 60 

GCCATGCACG CGATCGCCGG TTTCTCCGAT GCGTTGCGCC AAGAGCTGCG GGGTAGCGGA 120 

ATCGCCGTCT CGGTGATCCA CCCGGCGCTG ACCCAGACAC CGCTGTTGGC CAACGTCGAC 180 

CCCGCCGACA TGCCGCCGCC GTTTCGCAGC CTCACGCCCA TTCCCGTTCA CTGGGTCGCG 24 0 

GCAGCGGTGC TTGACGGTGT GGCG 264 

(2) INFORMATION FOR SEQ ID NO: 164: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1171 base pairs 



5 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:164: 

TAGTCGGCGA CGATGACGTC GCGGTCCAGG CCGACCGCTT CAAGCAC C AG CGCGACCACG 60 

AAGCCGGTGC GATCCTTACC CGCGAAGCAG TGGGTGAGCA CCGGGCGTCC GGCGGCAAGC 120 

AGTGTGACGA CACGATGTAG CGCGCGCTGT GCTCCATTGC GCGTTGGGAA TTGGCGATAC 180 

TCGTCGGTCA TGTAGCGGGT GGCCGCGTCA TTTATCGACT GGCTGGATTC GCCGGACTCG 24 0 

CCGTTGGACC CGTCATTGGT TAGCAGCCTC TTGAATGCGG TTTCGTGCGG CGCTGAGTCG 300 

TCGGCGTCAT CATCGGCGAG GTCGGGGAAC GGCAGCAGGT GGACGTCGAT GCCGTCCGGA 360 

ACCCGTCCTG GACCGCGGCG GGCAACCTCC CGGGACGACC GCAGGTCGGC AACGTCGGTG 420 

ATCCCCAGCC GGCGCAGCGT TGCCCCTCGT GCCGAATTCG GCACGAGGCT GGCGAGCCAC 480 

CGGGCATCAC CAAGCAACGC TTGCC CAGTA CGGATCGTCA CTTCCGCATC CGGCAGACCA 54 0 

W ATCTCCTCGC CGCCCATCGT CAGATCCCGC TCGTGCGTTG ACAAGAACGG CCGCAGATGT 600 

5 

O GCCAGCGGGT ATCGGAGATT GAACCGCGCA CGCAGTTCTT CAATCGCTGC GCGCTGCCGC 660 

m 

fU ACTATTGGCA CTTTC CGGCG GTCGCGGTAT TCAGCAAGCA TGCGAGTCTC GACGAACTCG 720 

y 5 

Q CCCCACGTAA CCCACGGCGT AGCTCCCGGC GTGACGCGGA GGATCGGCGG GTGATCTTTG 78 0 

ru 

CCGCCACGCT CGTAGCCGTT GATCCACCGC TTCGCGGTGC CGGCGGGGAG GCCGATCAGC 840 

TTATCGACCT CGGCGTATGC CGACGGCAAG CTGGGCGCGT TCGTCGAGGT CAAGAACTCC 900 

ACCATCGGCA CCGGCACCAA GGTGCCGCAC CTGACCTACG TCGGCGACGC CGACATCGGC 960 

GAGTACAGCA ACATCGGCGC CTCCAGCGTG TTCGTCAACT ACGACGGTAC GTCCAAACGG 1020 

CGCACCACCG TCGGTTCGCA CGTACGGACC GGGTC CGACA CCATGTTCGT GGCCCCAGTA 108 0 

ACCATCGGCG ACGGCGCGTA TACCGGGGCC GGCACAGTGG TGCGGGAGGA TGTCCCGCCG 1140 

GGGGCGCTGG CAGTGTCGGC GGGTCCGCAA C 1171 
(2) INFORMATION FOR SEQ ID NO: 165: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 227 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 165: 
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GCAAAGGCGG CACCGGCGGG GCCGGCATGA ACAGCCTCGA CCCGCTGCTA GCCGCCCAAG 
ACGGCGGCCA AGGCGGCACC GGCGGCACCG GCGGCAACGC CGGCGCCGGC GGCACCAGCT 
TCACCCAAGG CGCCGACGGC AACGCCGGCA ACGGCGGTGA CGGCGGGGTC GGCGGCAACG 
GCGGAAACGG CGGAAACGGC GCAGACAACA CCACCACCGC CGCCGCC 
(2) INFORMATION FOR SEQ ID NO: 166: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 04 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 166: 
CCTCGCCACC ATGGGCGGGC AGGGCGGTAG CGGTGGCGCC GGCTCTACCC CAGGCGCCAA 
GGGCGCCCAC GGCTTCACTC CAACCAGCGG CGGCGACGGC GGCGACGGCG GCAACGGCGG 
CAACTCCCAA GTGGTCGGCG GCAACGGCGG CGACGGCGGC AATGGCGGCA ACGGCGGCAG 
CGCCGGCACG GGCGGCAACG GCGGCCGCGG CGGCGACGGC GCGTTTGGTG GCATGAGTGC 
CAACGCCACC AACCCTGGTG AAAACGGGCC AAACGGTAAC CCCGGCGGCA ACGGTGGCGC 
CGGC 

(2) INFORMATION FOR SEQ ID NO: 167: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 143 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 167: 
GTGGGACGCT GCCGAGGCTG TATAACAAGG ACAACATCGA CCAGCGCCGG CTCGGTGAGC 
TGATCGACCT ATTTAACAGT GCGCGCTTCA GCCGGCAGGG CGAGCACCGC GCCCGGGATC 
TGATGGGTGA GGTCTACGAA TACTTCCTCG GCAATTTCGC TCGCGCGGAA GGGAAGCGGG 
GTGGCGAGTT CTTTACCCCG CCCAGCGTGG TCAAGGTGAT CGTGGAGGTG CTGGAGCCGT 
CGAGTGGGCG GGTGTATGAC CCGTGCTGCG GTTCCGGAGG CATGTTTGTG CAGACCGAGA 
AGTTCATCTA CGAACACGAC GGCGATCCGA AGGATGTCTC GATCTATGGC CAGGAAAGCA 
TTGAGGAGAC CTGGCGGATG GCGAAGATGA ACCTCGCCAT CCACGGCATC GACAACAAGG 
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GCACCCGGAC GTGCAGATGG 480 

GGCCCGCAAC GAGGAAGACC 540 

CTACGCATGG ATTCAGCACA 600 

GATGGCCAAC GGGTCGATGT 660 

CGTGGAGGCG GATTTGGTTT 720 

CGGAATCCCG GTGTGCCTGT 780 

TATCAACCGG TGCGGGCAGG 840 

CCGGGCCGAG CGGGCGCTGA 900 

GAGCACGACC ACCGGCAACG 960 

CAACGGCGCG GGCGGTGCTG 1020 

CGGCAACGCT GTGGGCGGCG 1080 

CACGACGGGC GGCGCCGGCG 114 0 

AGGCGTCGTC AACGTCACCG 1200 

CAACGGCTCC GCGGGCGCCG 126 0 

CCACGGCGGC GGTGCCACCG 132 0 

CGCCAGCGGC TCAGGCGTCA 1380 

CCGCAACGGC GGCAACGGC 1439 
(2) INFORMATION FOR SEQ ID NO: 168: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 329 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 168: 

GGGCCGGCGG GGCCGGATTT TCTCGTGCCT TGATTGTCGC TGGGGATAAC GGCGGTGATG 60 

GTGGTAACGG CGGGATGGGC GGGGCTGGCG GGGCTGGCGG CCCCGGCGGG GCCGGCGGCC 12 0 

TGATCAGCCT GCTGGGCGGC CAAGGCGCCG GCGGGGCCGG CGGGACCGGC GGGGCCGGCG 180 

GTGTTGGCGG TGACGGCGGG GCCGGCGGCC CCGGCAACCA GGCCTTCAAC GCAGGTGCCG 240 

GCGGGGCCGG CGGCCTGATC AGCCTGCTGG GCGGCCAAGG CGCCGGCGGG GCCGGCGGGA 300 

CCGGCGGGGC CGGCGGTGTT GGCGGTGAC 32 9 



GGCTCGGCGC CCGATGGAGT GATACCTTCG CCCGCGACCA 



ACTACGTGAT GGCCAATCCG CCGTTCAACA TCAAAGACTG 



CACGCTGGCG CTTCGGTGTT CCGCCCGCCA ATAACGCCAA 



TCCTGTACAA CTTGGCGCCG GGAGGTCGGG CGGGCGTGGT 



CGTCGAACTC CAACGGCAAG GGGGATATTC GCGCGCAAAT 



CCTGCATGGT CGCGTTACCC ACCCAGCTGT TCCGCAGCAC 



GGTTTTTCGC CAAAAACAAG GCGGCAGGTA AGCAAGGGTC 



TGCTqgJCAT CGACGCTCGT GAACTGGGCG ACCTAGTGGA 



CCAACGAGGA GATCGTCCGC ATCGGGGATA CCTTCCACGC 

O CCGGCTCCGG TGGTGCCGGC GGTAATGGGG GCACTGGCCT 

Q 

OQ GCGGGGCCGG CGGCAACGCG GGTGTCGCCG GCGTGTCCTT 

00 ACGGCGGCAA CGGCGGCAAC GGCGGCCACG GCGGCGACGG 

JS 

Ly GCAAGGGCGG CAACGGCAGC AGCGGTGCCG CCAGCGGCTC 

O CCGGCCACGG CGGCAACGGC GGCAATGGCG GCAACGGCGG 

ru 

HI GCGGCCAGGG CGGTGCCGGC GGCAGCGCCG GCAACGGCGG 

E "35? 

ui 

» GCGGCGCCAG CGGCAAGGGC GGCAACGGCA CCAGCGGTGC 



TCAACGTCAC CGCCGGCCAC GGCGGCAACG GCGGCAATGG 



# 
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O 



(2) INFORMATION FOR SEQ ID NO: 169: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 80 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 169: 
GCAACGGTGG CAACGGCGGC ACCAGCACGA CCGTGGGGAT GGC CGGAGGT AACTGTGGTG 60 
CCGCCGGGCT GATCGGCAAC 80 
(2) INFORMATION FOR SEQ ID NO: 170: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 392 base pairs 
y (B) TYPE: nucleic acid 

W (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:170: 
e_ GGGCTGTGTC GCACTCACAC CGCCGCATTC GGCGACGTTG GCCGCCCAAT ATCCAGCTCA 60 

pj AGGCCTACTA CTTACCGTCG GAGGACCGCC GCATCAAGGT GCGGGTCAGC GCC CAAGGAA 120 

ni 

HI TCAAGGTCAT CGACCGCGAC GGGCATCGAG GCCGTCGTCG CGCGGCTCGG GCAGGATCCG 180 

□ 

f|J CCCCGGCGCA CTTCGCGCGC CAAGCGGGCT CATCGCTCCG AACGGCGGCG ATC CTGTGAG 240 

CACAACTGAT GGCGCGCAAC GAGATTCGTC CAATTGTCAA GCCGTGTTCG ACCGCAGGGA 300 
CCGGTTATAC GTATGTCAAC CTATGTCACT CGCAAGAACC GGCATAACGA TCCCGTGATC 360 
CGCCGACAGC CCACGAGTGC AAGACCGTTA CA 392 
(2) INFORMATION FOR SEQ ID NO: 171: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 535 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 171: 

ACCGGCGCCA CCGGCGGCAC CGGGTTCGCC GGTGGCGCCG GCGGGGCCGG CGGGCAGGGC 60 

GGTATCAGCG GTGCCGGCGG CACCAACGGC TCTGGTGGCG CTGGCGGCAC CGGCGGACAA 12 0 

GGCGGCGCCG GGGGCGCTGG CGGGGCCGGC GCCGATAACC CCACCGGCAT CGGCGGCGCC 180 
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GGCGGCACCG GCGGCACCGG CGGAGCGGCC GGAGCCGGCG GGGCCGGTGG CGCCATCGGT 24 0 

ACCGGCGGCA CCGGCGGCGC GGTGGGCAGC GTCGGTAACG CCGGGATCGG CGGTACCGGC 300 

GGTACGGGTG GTGTCGGTGG TGCTGGTGGT GCAGGTGCGG CTGCGGCCGC TGGCAGCAGC 360 

GCTACCGGTG GCGCCGGGTT CGCCGGCGGC GCCGGCGGAG AAGGCGGACC GGGCGGCAAC 420 

AGCGGTGTGG GCGGCACCAA CGGCTCCGGC GGCGCCGGCG GTGCAGGCGG CAAGGGCGGC 480 

ACCGGAGGTG CCGGCGGGTC CGGCGCGGAC AACCCCACCG GTGCTGGTTT CGCCG 535 
(2) INFORMATION FOR SEQ ID NO: 172: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 690 base pairs 

(B) TYPE: nucleic acid 
H (C) STRANDEDNESS: single 
W (D) TOPOLOGY: linear 

Q 

00 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:172: 

Kf CCGACGTCGC CGGGGCGATA CGGGGGTCAC CGACTACTAC ATCATCCGCA CCGAGAATCG 60 

■P 

'yj GCCGCTGCTG CAACCGCTGC GGGCGGTGCC GGTCATCGGA GATCCGCTGG CCGACCTGAT 120 

5 

Q CCAGCCGAAC CTGAAGGTGA TCGTCAACCT GGGCTACGGC GAC.CCGAACT ACGGCTACTC 18 0 

5 y 

f|J GACGAGCTAC GCCGATGTGC GAACGCCGTT CGGGCTGTGG CCGAACGTGC CGCCTCAGGT 24 0 

in 

CATCGCCGAT GCCCTGGCCG CCGGAACACA AGAAGGCATC CTTGACTTCA CGGCCGACCT 3 00 

m 

GCAGGCGCTG TCCGCGCAAC CGCTCACGCT CCCGCAGATC CAGCTGCCGC AACCCGCCGA 360 

TCTGGTGGCC GCGGTGGCCG CCGCACCGAC GCCGGCCGAG GTGGTGAACA CGCTCGCCAG 420 

GATCATCTCA ACCAACTACG CCGTCCTGCT GCCCACCGTG GACATCGCCC TCGCCTGGTC 480 

ACCACCCTGC CGCTGTACAC CACCCAACTG TTCGTCAGGC AACTCGCTGC GGGCAATCTG 540 

ATCAACGCGA TCGGCTATCC CCTGGCGGCC ACCGTAGGTT TAGGCACGAT CGATAGCGGG 600 

CGGCGTGGAA TTGCTCACCC TCCTCGCGGC GGCCTCGGAC ACCGTTCGAA ACATCGAGGG 660 

CCTCGTCACC TAACGGATTC CCGACGGCAT .6 90 
(2) INFORMATION FOR SEQ ID NO: 173: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 407 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:173: 
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ACGGTGACGG CGGTACTGGC GGCGGCCACG GCGGCAACGG CGGGAATCCC GGGTGGCTCT 60 

TGGGCACAGC CGGGGGTGGC GGCAACGGTG GCGCCGGCAG CAC CGGTACT GCAGGTGGCG 120 

GCTCTGGGGG CACCGGCGGC GACGGCGGGA CCGGCGGGCG TGGCGGCCTG TTAATGGGCG 180 

CCGGCGCCGG CGGGCACGGT GGCACTGGCG GCGCGGGCGG TGCCGGTGTC GACGGTGGCG 240 

GCGCCGGCGG GGCCGGCGGG GCCGGCGGCA ACGGCGGCGC CGGGGGTCAA GCCGCCCTGC 300 

TGTTCGGGCG CGGCGGCACC GGCGGAGCCG GCGGCTACGG CGGCGATGGC GGTGGCGGCG 360 

GTGACGGCTT CGACGGCACG ATGGCCGGCC TGGGTGGTAC CGGTGGC 407 
C2) T!NFORMATTON FOR SEQ ID NO: 174 : 



(i) SEQUENCE CHARACTERISTICS: 

y (A) LENGTH: 468 base pairs 
Jjjf (B) TYPE: nucleic acid 

™ (C) STRANDEDNESS : single 

+J (D) TOPOLOGY: linear 

Bj 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:174: 

yj 

£ GATCGGTCAG CGCATCGCCC TCGGCGGCAA GCGATTCCGC GGTCTCACCG AAGAACATCG 60 

G 

PJ TGCACGCGGC GGCGCGGACC AGCCCGCTGC GCTGCGGCGC GTCGAACGCC TCCAGCAGGC 12 0 

fy 

U1 ACAGC CAGTC CTTGGCGGCC TGCGAGGCGA ACACGTCGGT GTCACCGGTG TAGATCGCCG 180 

o 

fy GGATGCCCGC CTCCGCCAAC GCATTCCGGC ACGCCCGCGC GTCTTTGTGA TGCTCGACGA 240 

TCACCGCGAT GTCTGCGGCC ACCACGGGCC GCCCGGCGAA GGTGGCCCCG CTGGCCAGTA 300 

GCGCCGCGAC GTCGGCGGCC AGGTCGTCGG GGATGTGCCG GCGCAGCGCT CCGGCGCGAC 360 

GCCCGAAAAA CGACCCCTCA CCCAGCTGGG TCCCGCTGGC ATATCCCTTG CCGTCCTGGG 420 

CGATATTGGA CGCGCATGCC CCGACCGCGT ACAGGCCGGC CACCACCG 468 
(2) INFORMATION FOR SEQ ID NO: 175: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 219 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 175: 
GGTGGTAACG GCGGCCAGGG TGGCATCGGC GGCGCCGGCG AGAGAGGCGC CGACGGCGCC 60 



GGCCCCAATG CTAACGGCGC AAACGGCGAG AACGGCGGTA GCGGTGGTAA CGGTGGCGAC 120 
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GGCGGCGCCG GCGGCAATGG CGGCGCGGGC GGCAACGCGC AGGCGGCCGG GTACACCGAC 
GGCGCCACGG GCACCGGCGG CGACGGCGGC AACGGCGGC 
(2) INFORMATION FOR SEQ ID NO: 176: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 94 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 176: 
TAGCTCCGGC GAGGGCGGCA AGGGCGGCGA CGGTGGCCAC GGCGGTGACG GCGTCGGCGG 
CAACAGTTCC GTCACCCAAG GCGGCAGCGG CGGTGGCGGC GGCGCCGGCG GCGCCGGCGG 
CAGCGGCTTT TTCGGCGGCA AGGGCGGCTT CGGCGGCGAC GGCGGTCAGG GCGGCCCCAA 
CGGCGGCGGT ACCGTCGGCA CCGTGGCCGG TGGCGGCGGC AACGGCGGTG TCGGCGGCCG 
GGGCGGCGAC GGCGTCTTTG CCGGTGCCGG CGGCCAGGGC GGCCTCGGTG GGCAGGGCGG 
CAATGGCGGC GGCTCCACCG GCGGCAACGG CGGCCTTGGC GGCGCGGGCG GTGGCGGAGG 
CAACGCCCCG GCTCGTGCCG AATCCGGGCT GACCATGGAC AGCGCGGCCA AGTTCGCTGC 
CATCGCATCA GGCGCGTACT GCCCCGAACA C CTGGAAC AT CACCCGAGTT AGCGGGGCGC 
ATTTCCTGAT CACC 

(2) INFORMATION FOR SEQ ID NO: 177: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 220 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:177: 
GGGCCGGTGG TGCCGCGGGC CAGCTCTTCA GCGCCGGAGG CGCGGCGGGT GCCGTTGGGG 
TTGGCGGCAC CGGCGGCCAG GGTGGGGCTG GCGGTGC CGG AGCGGCCGGC GCCGACGCCC 
CCGCCAGCAC AGGTCTAACC GGTGGTAC CG GGTTCGCTGG CGGGGCCGGC GGCGTCGGCG 
GCCAGAGCGG CAACGCCATT GCCGGCGGCA TCAACGGCTC 
(2) INFORMATION FOR SEQ ID NO: 178: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 88 base pairs 

(B) TYPE: nucleic acid 



□ 

00 

pa 



170 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:178: 

ATGGCGGCAA CGGGGGCCCC GGCGGTGCTG GCGGGGCCGG CGACTACAAT TTCCAACGGC 60 

GGGCAGGGTG GTGCCGGCGG CCAAGGCGGC CAAGGCGGC C TGGGCGGGGC AAGCACCACC 120 

TGATCGGCCT AGCCGCACCC GGGAAAGCCG ATCCAACAGG CGACGATGCC GCCTTCCTTG 180 

CCGCGTTGGA CCAGGCCGGC ATCACCTACG CTGACCCAGG CCACGCCATA ACGGCCGCCA 240 

AGGCGATGTG TGGGCTGTGT GCTAACGGCG TAACAGGTCT ACAGCTGGTC GCGGACCTGC 300 

GGGACTACAA TCCCGGGCTG ACCATGGACA GCGCGGCCAA GTTCGCTGCC ATCGCATCAG 360 

GCGCGTACTG CCCCGAACAC CTGGAACA 388 
(2) INFORMATION FOR SEQ ID NO: 179: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 00 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
l_ (D) TOPOLOGY: linear 

o 

fy (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 179: 

ru 

Iff GCAAAGGCGG CACCGGCGGG GCCGGCATGA ACAGC CTCGA CCCGCTGCTA GCCGCCCAAG 60 

fy ACGGCGGCCA AGGCGGCACC GGCGGCACCG GCGGCAACGC CGGCGCCGGC GGCACCAGCT 12 0 

TCACCCAAGG CGCCGACGGC AACGCCGGCA ACGGCGGTGA CGGCGGGGTC GGCGGCAACG 180 

GCGGAAACGG CGGAAACGGC GCAGACAACA CCACCACCGC CGCCGCCGGC ACCACAGGCG 240 

GCGACGGCGG GGCCGGCGGG GCCGGCGGAA CCGGCGGAAC CGGCGGAGCC GCCGGCACCG 300 

GCACCGGCGG CCAACAAGGC AACGGCGGCA ACGGCGGCAC CGGCGGCAAA GGCGGCACCG 360 

GCGGCGACGG TGCACTCTCA GGCAGCACCG GTGGTGCCGG 400 
(2) INFORMATION FOR SEQ ID NO: 180: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 538 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 180: 
GGCAACGGCG GCAACGGCGG CATCGCCGGC ATTGGGCGGC AACGGCGTTC CGGGACGGGC 6 0 
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AGCGGCAACG GCGGCCAACG GCGGCAGCGG CGGCAACGGC GGCAACGCCG GCATGGGCGG 
CAACAGCGGC ACCGGCAGCG GCGACGGCGG TGCCGGCGGG AACGGCGGCG CGGCGGGCAC 
GGGCGGCACC GGCGGCGACG GCGGCCTCAC CGGTACTGGC GGCACCGGCG GCAGCGGTGG 
CACCGGCGGT GACGGCGGTA ACGGCGGCAA CGGAGCAGAT AACACCGCAA ACATGACTGC 
GCAGGCGGGC GGTGACGGTG GCAACGGCGG CGACGGTGGC TTCGGCGGCG GGGCCGGGGC 
CGGCGGCGGT GGCTTGACCG CTGGCGC CAA CGGCACCGGC GGGCAAGGCG GCGCCGGCGG 
CGATGGCGGC AACGGGGCCA TCGGCGGCCA CGGCCCACTC ACTGACGACC CCGGCGGCAA 
CGGGGGCACC GGCGGCAACG GCGGCACCGG CGGCACCGGC GGCGCGGGCA TCGGCAGC 
(2) INFORMATION FOR SEQ ID NO: 181: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 9 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 181: 
GGGCCGGTGG TGC CGCGGGC CAGCTCTTCA GCGCCGGAGG CGCGGCGGGT GCCGTTGGGG 
TTGGCGGCAC CGGCGGCCAG GGTGGGGCTG GCGGTGCCGG AGCGGCCGGC GCCGACGCCC 
CCGCCAGCAC AGGTCTAACC GGTGGTACCG GGTTCGCTGG CGGGGCCGGC GGCGTCGGCG 
GCCACGGCGG CAACGCCATT GCCGGCGGCA TCAACGGCTC CGGTGGTGCC GGCGGCACC 
(2) INFORMATION FOR SEQ ID NO: 182: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 98 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 182: 
AGCAGCGCTA CCGGTGGCGC CGGGTTCGCC GGCGGCGCCG GCGGAGAAGG CGGAGCGGGC 
GGCAACAGCG GTGTGGGCGG CACCAACGGC TCCGGCGGCG CCGGCGGTGC AGGCGGCAAG 
GGCGGCACCG GAGGTGCCGG CGGGTCCGGC GCGGACAACC CCACCGGTGC TGGTTTCGCC 
GGTGGCGCCG GCGGCACAGG TGGCGCGGCC GGCGCCGGCG GGGCCGGCGG GGCGACCGGT 
ACCGGCGGCA CCGGCGGCGT TGTCGGCGCC ACCGGTAGTG CAGGCATCGG CGGGGCCGGC 
GGCCGCGGCG GTGACGGCGG CGATGGGGCC AGCGGTCTCG GCCTGGGCCT CTCCGGCTTT 
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GACGGCGGCC AAGGCGGC C A AGGCGGGGCC GGCGGCAGCG CCGGCGCCGG CGGCATCAAC 42 0 

GGGGCCGGCG GGGCCGGCGG CAACGGCGGC GACGGCGGGG ACGGCGCAAC CGGTGCCGCA 480 

GGTCTCGGCG ACAACGGCGG GGTCGGCGGT GACGGTGGGG CCGGTGGCGC CGCCGGCAAC 540 

GGCGGCAACG CGGGCGTCGG CCTGACAGCC AAGGCCGGCG ACGGCGGCGC CGCGGGCAAT 600 

GGCGGCAACG GGGGCGCCGG CGGTGCTGGC GGGGCCGGCG ACAACAATTT CAACGGCGGC 660 

CAGGGTGGTG CCGGCGGCCA AGGCGGCCAA GGCGGCTTGG GCGGGGCAAG CACCACCTGA 720 

TCGGCCTAGC CGCACCCGGG AAAGCCGATC CAACAGGCGA CGATGCCGCC TTCCTTGCCG 780 

CGTTGGACCA GGCCGGCATC ACCTACGCTG ACCCAGGCCA CGCCATAACG GCCGCCAAGG 840 

CGATGTGTGG GCTGTGTGCT AACGGCGTAA CAGGTCTACA GCTGGTCGCG G AC CTGCGGG 900 

O AATACAATCC CGGGCTGACC ATGGACAGCG CGGCCAAGTT CGCTGCCATC GCATCAGGCG 960 

m 

CGTACTGCCC CGAACACCTG GAACA 985 

m 

(2) INFORMATION FOR SEQ ID NO: 183: 

hj 

5 (i) SEQUENCE CHARACTERISTICS: 

g (A) LENGTH: 2138 base pairs 

(B) TYPE: nucleic acid 
=1 (C) STRANDEDNESS : single 



c 5 S 



F3 3 



(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:183: 

CGGCACGAGG ATCGGTACCC CGCGGCATCG GCAGCTGCCG ATTCGCCGGG TTTCCCCACC 60 

CGAGGAAAGC CGCTACCAGA TGGCGCTGCC GAAGTAGGGC GATCCGTTCG CGATGCCGGC 120 

ATGAACGGGC GGCATCAAAT TAGTGCAGGA ACCTTTCAGT TTAGCGACGA TAATGGCTAT 18 0 

AGCACTAAGG AGGATGATCC GATATGACGC AGTCGCAGAC CGTGACGGTG GATCAGCAAG 240 

AGATTTTGAA CAGGGCCAAC GAGGTGGAGG CCCCGATGGC GGACCCACCG ACTGATGTCC 300 

CCATCACACC GTGCGAACTC ACGGCGGCTA AAAACGCCGC CCAACAGCTG GTATTGTCCG 360 

CCGACAACAT GCGGGAATAC CTGGCGGCCG GTGCCAAAGA GCGGCAGCGT CTGGCGACCT 42.0 

CGCTGCGCAA CGCGGC C AAG GCGTATGGCG AGGTTGATGA GGAGGCTGCG ACCGCGCTGG 480 

ACAACGACGG CGAAGGAACT GTGCAGGCAG AATCGGCCGG GGCCGTCGGA GGGGACAGTT 540 

CGGCCGAACT AACCGATACG CCGAGGGTGG CCACGGCCGG TGAACCCAAC TTCATGGATC 600 

TCAAAGAAGC GGCAAGGAAG CTCGAAACGG GCGACCAAGG CGCATCGCTC GCGCACTTTG 660 
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CGGATGGGTG GAACACTTTC AACCTGACGC TGCAAGGCGA CGTCAAGCGG TTCCGGGGGT 720 

TTGACAACTG GGAAGGCGAT GCGGCTACCG CTTGCGAGGC TTCGCTCGAT CAACAACGGC 780 

AATGGATACT CCACATGGCC AAATTGAGCG CTGCGATGGC CAAGCAGGCT CAATATGTCG 84 0 

CGCAGCTGCA CGTGTGGGCT AGGCGGGAAC ATCCGACTTA TGAAGACATA GTCGGGCTCG 900 

AACGGCTTTA CGCGGAAAAC CCTTCGGCCC GCGACCAAAT TCTCCCGGTG TACGCGGAGT 960 

ATCAGCAGAG GTCGGAGAAG GTGCTGACCG AATACAACAA CAAGGCAGCC CTGGAACCGG 1020 

TAAACCCGCC GAAGCCTCCC CCCGCCATCA AGATCGACCC GCCCCCGCCT CCGCAAGAGC 1080 

AGGGATTGAT CCCTGGCTTC CTGATGCCGC CGTCTGACGG CTCCGGTGTG ACTCCCGGTA 1140 

CCGGGATGCC AGCCGCACCG ATGGTTCCGC CTACCGGATC GCCGGGTGGT GGCCTCCCGG 1200 

8 a 

O CTGACACGGC GGCGCAGCTG ACGTCGGCTG GGCGGGAAGC CGCAGCGCTG TCGGGCGACG 1260 

03 TGGCGGTCAA AGCGGCATCG CTCGGTGGCG GTGGAGGCGG CGGGGTGCCG TCGGCGCCGT 1320 

CO TGGGATCCGC GATCGGGGGC GCCGAATCGG TGCGGCCCGC TGGCGCTGGT GACATTGCCG 138 0 

Ly GCTTAGGCCA GGGAAGGGCC GGCGGCGGCG CCGCGCTGGG CGGCGGTGGC ATGGGAATGC 1440 

D CGATGGGTGC CGCGCATCAG GGACAAGGGG GCGCCAAGTC CAAGGGTTCT CAGCAGGAAG 15 00 

F2 ! 

= y 

nj ACGAGGCGCT CTACACCGAG GATCGGGCAT GGACCGAGGC CGTCATTGGT AACCGTCGGC 1560 

GCCAGGACAG TAAGGAGTCG AAGTGAGCAT GGACGAATTG GACCCGCATG TCGCCCGGGC 162 0 

GTTGACGCTG GCGGCGCGGT TTCAGTCGGC CCTAGACGGG ACGCTCAATC AGATGAACAA 1680 

CGGATCCTTC CGCGCCACCG ACGAAGCCGA GACCGTCGAA GTGACGATCA ATGGGCACCA 1740 

GTGGCTCACC GGCCTGCGCA TCGAAGATGG TTTGCTGAAG AAGCTGGGTG CCGAGGCGGT 1800 

GGCTCAGCGG GTCAACGAGG CGCTGCACAA TGCGCAGGCC GCGGCGTCCG CGTATAACGA 1860 

CGCGGCGGGC GAGCAGCTGA CCGCTGCGTT ATCGGC CATG TCCCGCGCGA TGAACGAAGG 192 0 

AATGGCCTAA GCCCATTGTT GCGGTGGTAG CGACTACGCA CCGAATGAGC GCCGCAATGC 1980 

GGTCATTCAG CGCGCCCGAC ACGGCGTGAG TACGCATTGT CAATGTTTTG ACATGGATCG 2040 

GCCGGGTTCG GAGGGCGC C A TAGTCCTGGT CGCCAATATT GCCGCAGCTA GCTGGTCTTA 2100 

GGTTCGGTTA CGCTGGTTAA TTATGACGTC CGTTACCA 2138 
(2) INFORMATION FOR SEQ ID NO: 184: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 460 amino acids 

(B) TYPE : amino acid 



in 
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(C) STRANDEDNESS : 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 184: 

Met Thr Gin Ser Gin Thr Val Thr Val Asp Gin Gin Glu lie Leu Asn 
15 10 15 

Arg Ala Asn Glu Val Glu Ala Pro Met Ala Asp Pro Pro Thr Asp Val 
20 25 30 

Pro lie Thr Pro Cys Glu Leu Thr Ala Ala Lys Asn Ala Ala Gin Gin 
35 40 45 

Leu Val Leu Ser Ala Asp Asn Met Arg Glu Tyr Leu Ala Ala Gly Ala 

5G 55 60 

Lys Glu Arg Gin Arg Leu Ala Thr Ser Leu Arg Asn Ala Ala Lys Ala 
65 70 75 80 

Tyr Gly Glu Val Asp Glu Glu Ala Ala Thr Ala Leu Asp Asn Asp Gly 
85 90 95 

Glu Gly Thr Val Gin Ala Glu Ser Ala Gly Ala Val Gly Gly Asp Ser 
100 105 110 

Ser Ala Glu Leu Thr Asp Thr Pro Arg Val Ala Thr Ala Gly Glu Pro 
115 120 125 

Asn Phe Met Asp Leu Lys Glu Ala Ala Arg Lys Leu Glu Thr Gly Asp 
130 135 140 

Gin Gly Ala Ser Leu Ala His Phe Ala Asp Gly Trp Asn Thr Phe Asn 
145 150 155 160 

Leu Thr Leu Gin Gly Asp Val Lys Arg Phe Arg Gly Phe Asp Asn Trp 
165 170 175 

Glu Gly Asp Ala Ala Thr Ala Cys Glu Ala Ser Leu Asp Gin Gin Arg 
180 185 190 

Gin Trp lie Leu His Met Ala Lys Leu Ser Ala Ala Met Ala Lys Gin 
195 200 205 

Ala Gin Tyr Val Ala Gin Leu His Val Trp Ala Arg Arg Glu His Pro 
210 215 220 

Thr Tyr Glu Asp lie Val Gly Leu Glu Arg Leu Tyr Ala Glu Asn Pro 
225 230 235 240 

Ser Ala Arg Asp Gin lie Leu Pro Val Tyr Ala Glu Tyr Gin Gin Arg 
245 250 255 



Ser Glu Lys Val Leu Thr Glu Tyr Asn Asn Lys Ala Ala Leu Glu Pro 
260 265 270 
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Val Asn Pro Pro Lys Pro Pro Pro Ala lie Lys lie Asp Pro Pro Pro 
275 280 285 

Pro Pro Gin Glu Gin Gly Leu lie Pro Gly Phe Leu Met Pro Pro Ser 
290 295 300 - 

Asp Gly Ser Gly Val Thr Pro Gly Thr Gly Met Pro Ala Ala Pro Met 
305 310 315 320 

Val Pro Pro Thr Gly Ser Pro Gly Gly Gly Leu Pro Ala Asp Thr Ala 
325 330 335 

Ala Gin Leu Thr Ser Ala Gly Arg Glu Ala Ala Ala Leu Ser Gly Asp 
340 345 350 

Val Ala Val Lys Ala Ala Ser Leu Gly Gly Gly Gly Gly Gly Gly Val 

M* 355 360 365 

o 

O Pro Ser Pr ° Leu G1 y Ser Ala Ile G1 y G1 y Ala Glu Ser Val 

m 370 375 380 

QJ Pro Ala Gly Ala Gly Asp lie Ala Gly Leu Gly Gin Gly Arg Ala Gly 

ir* 385 390 395 400 



■ry 
in 



Gly Gly Ala Ala Leu Gly Gly Gly Gly Met Gly Met Pro Met Gly Ala 
405 410 415 

Ala His Gin Gly Gin Gly Gly Ala Lys Ser Lys Gly Ser Gin Gin Glu 
420 425 430 



jjrjj Asp Glu Ala Leu Tyr Thr Glu Asp Arg Ala Trp Thr Glu Ala Val lie 

n? 435 440 445 

Gly Asn Arg Arg Arg Gin Asp Ser Lys Glu Ser Lys 
450 455 460 

(2) INFORMATION FOR SEQ ID NO: 185: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 277 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 185: 

Ala Gly Asn Val Thr Ser Ala Ser Gly Pro His Arg Phe Gly Ala Pro 
1 5 1° 15 

Asp Arg Gly Ser Gin Arg Arg Arg Arg His Pro Ala Ala Ser Thr Ala 
20 25 30 



Thr 



Glu Arg Cys Arg Phe Asp Arg His Val Ala Arg Gin Arg Cys Gly 
35 40 45 
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Phe Pro Pro Ser Arg Arg Gin Leu Arg Arg Arg Val Ser Arg Glu Ala 
50 55 60 

Thr Thr Arg Arg Ser Gly Arg Arg Asn His Arg Cys Gly Trp His Pro 
65 70 75 - 80 

Gly Thr Gly Ser His Thr Gly Ala Val Arg Arg Arg His Gin Glu Ala 
85 90 95 

Arg Asp Gin Ser Leu Leu Leu Arg Arg Arg Gly Arg Val Asp Leu Asp 
100 105 110 

Gly Gly Gly Arg Leu Arg Arg Val Tyr Arg Phe Gin Gly Cys Leu Val 
115 120 125 

Val Val Phe Gly Gin His Leu Leu Arg Pro Leu Leu lie Leu Arg Val 

M= 130 135 140 

Q His Arg Glu Asn Leu Val Ala Gly Arg Arg Val Phe Arg Val Lys Pro 

(E 145 150 155 160 

Spy Phe Glu Pro Asp Tyr Val Phe lie Ser Arg Met Phe Pro Pro Ser Pro 

£ 165 170 175 

W 



o 



m 

m 



His Val Gin Leu Arg Asp lie Leu Ser Leu Leu Gly His Arg Ser Ala 
180 185 190 



y 

l Gin Phe Gly His Val Glu Tyr Pro Leu Pro Leu Leu lie Glu Arg Ser 

195 200 205 



Leu Ala Ser Gly Ser Arg lie Ala Phe Pro Val Val Lys Pro Pro Glu 
210 215 220 

Pro Leu Asp Val Ala Leu Gin Arg Gin Val Glu Ser Val Pro Pro lie 
225 230 235 240 

Arg Lys Val Arg Glu Arg Cys Ala Leu Val Ala Arg Phe Glu Leu Pro 
245 250 255 

Cys Arg Phe Phe Glu lie His Glu Val Gly Phe Thr Gly Arg Gly His 
260 265 270 

Pro Arg Arg lie Gly 
275 

(2) INFORMATION FOR SEQ ID NO: 186: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 192 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l86: 
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Arg Val Ala Ala Ser Phe lie Asp Trp Leu Asp Ser Pro Asp Ser Pro 
1 5 10 15 

Leu Asp Pro Ser Leu Val Ser Ser Leu Leu Asn Ala Val Ser Cys Gly 
20 25 " 30 

Ala Glu Ser Ser Ala Ser Ser Ser Ala Arg Ser Gly Asn Gly Ser Arg 
35 40 45 

Trp Thr Ser Met Pro Ser Gly Thr Arg Pro Gly Pro Arg Arg Ala Thr 

50 55 60 

Ser Arg Asp Asp Arg Arg Ser Ala Thr Ser Val lie Pro Ser Arg Arg 
65 70 75 80 

Ser Val Ala Pro Arg Ala Glu Phe Gly Thr Arg Leu Ala Ser His Arg 

M- 85 90 95 



Ala Ser Pro Ser Asn Ala Cys Pro Val Arg lie Val Thr Ser Ala Ser 
03 100 105 110 

fg Gly Arg Pro lie Ser Ser Pro Pro lie Val Arg Ser Arg Ser Cys Val 

M 115 120 125 

U 

B Asp Lys Asn Gly Arg Arg Cys Ala Ser Gly Tyr Arg Arg Leu Asn Arg 

Q 130 135 140 

RJ 

fy Ala Arg Ser Ser Ser lie Ala Ala Arg Cys Arg Thr lie Gly Thr Phe 

= S 145 150 155 160 

u3 



: Li 



Arg Arg Ser Arg Tyr Ser Ala Ser Met Arg Val Ser Thr Asn Ser Pro 
165 170 175 

His Val Thr His Gly Val Ala Pro Gly Val Thr Arg Arg lie Gly Gly 
180 185 190 



(2) INFORMATION FOR SEQ ID NO: 187: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 196 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 187: 

Gin Glu Arg Pro Gin Met Cys Gin Arg Val Ser Glu lie Glu Pro Arg 
15 10 15 

Thr Gin Phe Phe Asn Arg Cys Ala Leu Pro His Tyr Trp His Phe Pro 
20 25 30 

Ala Val Ala Val Phe Ser Lys His Ala Ser Leu Asp Glu Leu Ala Pro 
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35 40 45 

Arg Asn Pro Arg Arg Ser Ser Arg Arg Asp Ala Glu Asp Arg Arg Val 
50 55 60 

lie Phe Ala Ala Thr Leu Val Ala Val Asp Pro Pro Leu Arg Gly Ala 
65 70 75 80 

Gly Gly Glu Ala Asp Gin Leu He Asp Leu Gly Val Cys Arg Arg Gin 
85 90 95 

Ala Gly Arg Val Arg Arg Gly Gin Glu Leu His His Arg His Arg His 
100 105 110 

Gin Gly Ala Ala Pro Asp Leu Arg Arg Arg Arg Arg His Arg Arg Val 

TX5 X20 T2S 

M* Gin Gin His Arg Arg Leu Gin Arg Val Arg Gin Leu Arg Arg Tyr Val 

O 130 135 140 

CO 

Ey Gin Thr Ala His His Arg Arg Phe Ala Arg Thr Asp Arg Val Arg His 

J° 145 150 155 160 

Gs 

jj His Val Arg Gly Pro Ser Asn His Arg Arg Arg Arg Val Tyr Arg Gly 

yj 165 170 175 

s 

f=s Arg His Ser Gly Ala Gly Gly Cys Pro Ala Gly Gly Ala Gly Ser Val 

Si 180 185 190 



LP 
n 



Gly Gly Ser Ala 
195 

(2) INFORMATION FOR SEQ ID NO:188: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 311 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 188: 

Val Arg Cys Gly Thr Leu Val Pro Val Pro Met Val Glu Phe Leu Thr 
1 5 10 15 

Ser Thr Asn Ala Pro Ser Leu Pro Ser Ala Tyr Ala Glu Val Asp Lys 
20 25 30 

Leu lie Gly Leu Pro Ala Gly Thr Ala Lys Arg Trp He Asn Gly Tyr 
35 40 45 

Glu Arg Gly Gly Lys Asp His Pro Pro He Leu Arg Val Thr Pro Gly 
50 55 60 



Ala Thr Pro Trp Val Thr Trp Gly Glu Phe Val Glu Thr Arg Met Leu 
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65 70 75 80 

Ala Glu Tyr Arg Asp Arg Arg Lys Val Pro lie Val Arg Gin Arg Ala 
85 90 95 

Ala lie Glu Glu Leu Arg Ala Arg Phe Asn Leu Arg Tyr Pro Leu Ala 
100 105 110 

His Leu Arg Pro Phe Leu Ser Thr His Glu Arg Asp Leu Thr Met Gly 
115 120 125 

Gly Glu Glu lie Gly Leu Pro Asp Ala Glu Val Thr lie Arg Thr Gly 
130 135 140 

Gin Ala Leu Leu Gly Asp Ala Arg Trp Leu Ala Ser Leu Val Pro Asn 

145 150 1S5 16"0 

Ser Ala Arg Gly Ala Thr Leu Arg Arg Leu Gly lie Thr Asp Val Ala 
165 170 175 

Asp Leu Arg Ser Ser Arg Glu Val Ala Arg Arg Gly Pro Gly Arg Val 
180 185 190 

Pro Asp Gly lie Asp Val His Leu Leu Pro Phe Pro Asp Leu Ala Asp 
195 200 205 

Asp Asp Ala Asp Asp Ser Ala Pro His Glu Thr Ala Phe Lys Arg Leu 
210 215 220 

Leu Thr Asn Asp Gly Ser Asn Gly Glu Ser Gly Glu Ser Ser Gin Ser 
225 230 235 240 

lie Asn Asp Ala Ala Thr Arg Tyr Met Thr Asp Glu Tyr Arg Gin Phe 
245 250 255 

Pro Thr Arg Asn Gly Ala Gin Arg Ala Leu His Arg Val Val Thr Leu 
260 265 270 

Leu Ala Ala Gly Arg Pro Val Leu Thr His Cys Phe Ala Gly Lys Asp 
275 280 285 

Arg Thr Gly Phe Val Val Ala Leu Val Leu Glu Ala Val Gly Leu Asp 
290 295 300 

Arg Asp Val lie Val Ala Asp 
305 310 

INFORMATION FOR SEQ ID NO: 18 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2072 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 189: 

CTCGTGCCGA TTCGGCACGA GCTGAGCAGC CCAAGGGGCC GTTCGGCGAA GTCATCGAGG 60 

CATTCGCCGA CGGGCTGGCC GGCAAGGGTA AGCAAATCAA CACCACGCTG AACAGCCTGT 120 

CGCAGGCGTT GAACGC CTTG AATGAGGGCC GCGGCGACTT CTTCGCGGTG GTACGCAGCC 180 

TGGCGCTATT CGTCAACGCG CTACATCAGG ACGACCAACA GTTCGTCGCG TTGAACAAGA 240 

AC CTTGCGGA GTTCACCGAC AGGTTGACCC ACTCCGATGC GGACCTGTCG AACGCCATCC 300 

AGCAATTCGA CAGCTTGCTC GCCGTCGCGC GCCCGTTCTT CGCCAAGAAC CGCGAGGTGC 360 

TGACGCATGA CGTCAATAAT CTCGCGACCG TGACCACCAC GTTGCTGCAG CCCGATCCGT 420 

TGGATGGGTT GGAGACCGTC CTGCACATCT TCCCGACGCT GGCGGCGAAC ATTAACCAGC 480 

Q TTTACCATCC GACACACGGT GGCGTGGTGT CGCTTTCCGC GTTCACGAAT TTCGCCAACC 540 

fQ CGATGGAGTT CATCTGCAGC TCGATTCAGG CGGGTAGCCG GCTCGGTTAT CAAGAGTCGG 600 

fjj CCGAACTCTG TGCGCAGTAT CTGGCGCCAG TCCTCGATGC GATCAAGTTC AACTACTTTC 660 

jhj CGTTCGGCCT GAACGTGGCC AGCACCGCCT CGACACTGCC TAAAGAGATC GCGTACTCCG 72 0 

,5 

r=s AGCCCCGCTT GCAGCCGCCC AACGGGTACA AGGACACCAC GGTGCCCGGC ATCTGGGTGC 78 0 

CGGATACGCC GTTGTCACAC CGCAACACGC AGCCCGGTTG GGTGGTGGCA CCCGGGATGC ,84 0 

AAGGGGTTCA GGTGGGACCG ATCACGCAGG GTTTGCTGAC GCCGGAGTCC CTGGCCGAAC 900 

TCATGGGTGG TCCCGATATC GCCCCTCCGT CGTCAGGGCT GCAAACCCCG CCCGGACCCC 960 

CGAATGCGTA CGACGAGTAC CCCGTGCTGC CGCCGATCGG TTTACAGGCC CCACAGGTGC 1020 

CGATACCACC GCCGCCTCCT GGGCCCGACG TAATCCCGGG TCCGGTGCCA CCGGTCTTGG 1080 

CGGCGATCGT GTTCCCAAGA GATCGCCCGG CAGCGTCGGA AAACTTCGAC TACATGGGCC 1140 

TCTTGTTGCT GTCGCCGGGC CTGGCGACCT TCCTGTTCGG GGTGTCATCT AGCCCCGCCC 1200 

GTGGAACGAT GGCCGATCGG CACGTGTTGA TACCGGCGAT CACCGGCCTG GCGTTGATCG 1260 

CGGCATTCGT CGCACATTCG TGGTAC CGCA CAGAACATCC GCTCATAGAC ATGCGCTTGT 1320 

TCCAGAACCG AGCGGTCGCG CAGGCCAACA TGACGATGAC GGTGCTCTCC CTCGGGCTGT 1380 

TTGGCTCCTT CTTGCTGCTC CCGAGCTACC TCCAGCAAGT GTTGCACCAA TCACCGATGC 1440 

AATCGGGGGT GCATATCATC CCACAGGGCC TCGGTGCCAT GCTGGCGATG CCGATCGCCG 1500 

GAGCGATGAT GGAC CGACGG GGACCGGCCA AGATCGTGCT GGTTGGGATC ATGCTGATCG 1560 

CTGCGGGGTT GGGCAC CTTC GCCTTTGGTG TCGCGCGGCA AGCGGACTAC TTACCCATTC 162 0 



m 
ru 

g 53 
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TGCCGACCGG GCTGGCAATC ATGGGCATGG GCATGGGCTG CTCCATGATG CCACTGTCCG 1680 

GGGCGGCAGT GCAGACCCTG GCCCCACATC AGATCGCTCG CGGTTCGACG CTGATCAGCG 174 0 

TCAAC CAGCA GGTGGGCGGT TCGATAGGGA CCGCACTGAT GTCGGTGCTG CTCACCTACC 1800 

AGTTCAATCA CAGCGAAATC ATCGCTACTG CAAAGAAAGT CGCACTGACC C C AG AGAGTG 1860 

GCGCCGGGCG GGGGGCGGCG GTTGACCCTT CCTCGCTACC GCGCCAAACC AACTTCGCGG 1920 

CCCAACTGCT GCATGACCTT TCGCACGCCT ACGCGGTGGT ATTCGTGATA GCGACCGCGC 1980 

TAGTGGTCTC GACGCTGATC CCCGCGGCAT TCCTGCCGAA AC AGCAGG CT AGTCATCGAA 2040 

GAGCACCGTT GCTATCCGCA TGACGTCTGC TT 2072 

M= (2) INFORMATION FOR SEQ ID NO: 190: 

□ 

O (i) SEQUENCE CHARACTERISTICS: 

Cp (A) LENGTH: 1923 base pairs 

=p (B) TYPE: nucleic acid 

03 (C) STRANDEDNESS : single 

„g (D) TOPOLOGY: linear 

w 

e (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 190: 

Q 

Pj TCACCCCGGA GAAGTCGTTC GTCGACGACC TGGACATCGA CTCGCTGTCG ATGGTCGAGA 60 

ry 

s~ TCGCCGTGCA GACCGAGGAC AAGTACGGCG TCAAGATCCC CGACGAGGAC CTCGCCGGTC 120 

TGCGTACCGT CGGTGACGTT GTCGCCTACA TCCAGAAGCT CGAGGAAGAA AACCCGGAGG 180 

CGGCTCAGGC GTTGCGCGCG AAGATTGAGT CGGAGAACCC CGATGCGGCA CGAGCAGATC 240 

GGTGCGTTTC ACCCACATCG CAAGCTCGAG ACGCCCGTCG TCCTCTTGCA CGCTCAGCCA 3 00 

GGTTGGCGTG TCGCCGCCTT CCAGCAAGTG TTCCCACCAC ACGAAGGGAC CCTCGCGAAA 360 

GGTGACTGAT CCGCGGACCA CATAGTCGAT GCCACCGTGG CTGACAATTG CGCCGGGTCC 420 

GAGTTGGCGG GGGCCGAATT GCGGCATTGC GTCGAAGGCC AGCGGATCCC GGCGCCCGCC 480.. 

CGGCGTGGCT GGTGTTTTGG GCCGCCGGAT GGCCACGACG AGAACGACGA TGGCGGCGAT 54 Q 

GAACAGCGCC ACGGCAATCA CGACCAGCAG ATTTCCCACG CATACCCTCT CGTACCGCTG 600 

CGCCGCGGTT GGTCGATCGG TCGCATATCG ATGGCGCCGT TTAACGTAAC AGCTTTCGCG 660 

GGACCGGGGG TCACAACGGG CGAGTTGTCC GGCCGGGAAC CCGGCAGGTC TCGGCCGCGG 72 0 

TCACCCCAGC TCACTGGTGC ACCATCCGGG TGTCGGTGAG CGTGCAACTC AAACACACTC 780 

AACGGCAACG GTTTCTCAGG TCACCAGCTC AACCTCGACC CGCAATCGCT CGTACGTTTC 84 0 



ru 
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GACCGCGCGC AGGTCGCGAG TCAGCAGCTT TGCGCCGGCA GCTTTCGCCG TGAAGC CGAC 900 

CAGGGCATCG TAGGTTGCGC CACCGGTGAC ATCGTGCTCG GCGAGGTGGT CGGTCAAGCC 96 0 

GCGATATGAG CAGGCATCCA GTGCCAGGTA GTTGCTGGAG GTGATGTCCG CCAAGTAGGC 102 0 

GTGGACGGCA ACAGGGGCAA TACGATGCGG CGGTGGTAGC CGGGTCAAGA CCGAATAGGT 108 0 

TTCCACAGCC GCGTGCGCGA TCAGATGGAC GCCACGGTTG AGCGCGCGCA CGGCGGCCTC 114 0 

GTGCCCTTCG TGCCAGGTCG CGAATC CGGC AACCAGCACG CTGGTGTCTG GTGCGATCAC 12 00 

CGCCGTGTGC GATCGAGCGT TTCCCGAACG ATTTCGTCGG TCAACGGGGG CAGGGGACGT 1260 

TCTGGCCGTG CGACGAGAAC CGAGCCTTCC CGAACGAGTT CGACACCGGT CGGGGCCGGC 132 0 

TCAATCTCGA TGCGCCCATC GCGCTCGGTG ATCTCCACCT GGTCGTTCCC GCGCAAGCCA 138 0 

Q AGGCGCTCGC GAATCCGCTT GGGAATCACC AGACGTCCTG CGACATCGAT GGTTGTTCGC 1440 

5 

yj ATGGTAGGAA ATTT AC CATC GCACGTTCCA TAGGCGTGTC CTGCGCGGGA TGTCGGGACG 15 00 

-P 

51 ATCCGCTAGC GTATCGAACG ATTGTTTCGG AAATGGCTGA GGGAGCGTGC GGTGCGGGTG 156 0 

«=» 

Ui ATGGGTGTCG ATCCCGGGTT GACCCGATGC GGGCTGTCGC TCATCGAGAG TGGGCGTGGT 162 0 

s 

□ CGGCAGCTCA CCGCGCTGGA TGTCGACGTG GTGCGCACAC CGTCGGATGC GGCCTTGGCG 1680 

flj CAGCGCCTGT TGGCCATCAG CGATGCCGTC GAGCACTGGC TGGACACCCA TCATCCGGAG 1740 



m 



GTGGTGGCTA TCGAACGGGT GTTCTCTCAG CTCAACGT.GA CCACGGTGAT GGGCACCGCG 18 00 

CAGGCCGGCG GCGTGATCGC CCTGGCGGCG GCCAAACGTG GTGTCGACGT GCATTTCCAT 1860 

ACCCCCAGCG AGGTCAAGGC GGCGGTCACT GGCAACGGTT CCGCAGACAA GGCTCAGGTC 1920 

ACC 1923 
(2) INFORMATION FOR SEQ ID NO: 191: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1055 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 191: 

CTGGCGTGCC AGTGTCACCG GCGATATGAC GTCGGCATTC AATTTCGCGG CCCCGCCGGA 60 

CCCGTCGCCA CCCAATCTGG ACCACCCGGT CCGTCAATTG CCGAAGGTCG CCAAGTGCGT 120 

GCCCAATGTG GTGCTGGGTT TCTTGAACGA AGGCCTGCCG TATCGGGTGC CCTACCCCCA 180 

AACAACGCCA GTCCAGGAAT CCGGTCCCGC GCGGCCGATT CCCAGCGGCA TCTGCTAGCC 24 0 
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GGGGATGGTT CAGACGTAAC GGTTGGCTAG GTCGAAACCC GCGC CAGGGC CGCTGGACGG 300 

GCTCATGGCA GCGAAATTAG AAAACCCGGG ATATTGTCCG CGGATTGTCA TACGATGCTG 360 

AGTGCTTGGT GGTTCGTGTT TAGCCATTGA GTGTGGATGT GTTGAGACCC TGGCCTGGAA 42 0 

GGGGACAACG TGCTTTTGCC TCTTGGTCCG CCTTTGCCGC CCGACGCGGT GGTGGCGAAA 480 

CGGGCTGAGT CGGGAATGCT CGGCGGGTTG TCGGTTCCGC TCAGCTGGGG AGTGGCTGTG 54 0 

CCACCCGATG ATTATGACCA CTGGGCGCCT GCGCCGGAGG ACGGCGCCGA TGTCGATGTC 600 

CAGGCGGCCG AAGGGGCGGA CGCAGAGGCC GCGGCCATGG ACGAGTGGGA TGAGTGGCAG 660 

GCGTGGAACG AGTGGGTGGC GGAGAACGCT GAACCCCGCT TTGAGGTGCC ACGGAGTAGC 720 

M= AGCAGCGTGA TTCCGCATTC TCCGGCGGCC GGCTAGGAGA GGGGGCGCAG ACTGTCGTTA 780 

s 

O TTTGACCAGT GATCGGCGGT CTCGGTGTTC CCGCGGCCGG CTATGACAAC AGTCAATGTG 840 

cC CATGACAAGT TACAGGTATT AGGTCCAGGT TCAACAAGGA GACAGGCAAC ATGGCAACAC 90 0 

m 

GTTTTATGAC GGATCCGCAC GCGATGCGGG ACATGGCGGG CCGTTTTGAG GTGCACGCCC 960 

m 

■ AGACGGTGGA GGACGAGGCT CGCCGGATGT GGGCGTCCGC GCAAAACATC TCGGGNGCGG 102 0 

Q 

jfy GCTGGAGTGG CATGGCCGAG GCGACCTCGC TAGAC 1055 

y5 (2) INFORMATION FOR SEQ ID NO:192: 

Q 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 359 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:192: 

CCGCCTCGTT GTTGGCATAC TCCGCCGCGG CCGCCTCGAC CGCACTGGCC GTGGCGTGTG 60 

TCCGGGCTGA CCACCGGGAT CGCCGAACCA TCCGAGATCA CCTCGCAATG ATCCACCTCG 120. 

CGCAGCTGGT CACCCAGCCA CCGGGCGGTG TGCGACAGCG CCTGCATCAC CTTGGTATAG 180 

CCGTCGCGCC CCAGCCGCAG GAAGTTGTAG TACTGGCCCA CCACCTGGTT ACCGGGACGG 24 0 

GAGAAGTTCA GGGTGAAGGT CGGCATGTCG CCGCCGAGGT AGTTGACCCG G AAAAC CAG A 300 

TCCTCCGGCA GGTGCTCGGG CCCGCGCCAC ACGACAAACC CGACGCCGGG ATAGGTCAG 359 

(2) INFORMATION FOR SEQ ID NO: 193: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 50 base pairs 



nj 



184 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 193: 

AACGGGCCCG TGGGCACCGC TCCTCTAAGG GCTCTCGTTG GTCGCATGAA GTGCTGGAAG 60 

GATGCATCTT GGCAGATTCC CGCCAGAGCA AAACAGCCGC TAGTCCTAGT CCGAGTCGCC 120 

CGCAAAGTTC CTCGAATAAC TCCGTACCCG GAGCGCCAAA CCGGGTCTCC TTCGCTAAGC 180 

TGCGCGAACC ACTTGAGGTT CCGGGACTCC TTGACGTCCA GACCGATTCG TTCGAGTGGC 240 

TGATCGGTTC GCCGCGCTGG CGCGAATCCG CCGCCGAGCG GGGTGATGTC AACCCAGTGG 300 

GTGGCCTGGA AGAGGTGCTC TACGAGCTGT CTCCGATCGA GGACTTCTCC 350 
O (2) INFORMATION FOR SEQ ID NO: 194: 

Q 

09. (i) SEQUENCE CHARACTERISTICS: 

Jp (A) LENGTH: 679 amino acids 

jyQ (B) TYPE: amino acid 

£ (C) STRANDEDNESS : 

bj (D) TOPOLOGY : linear 

p (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 194: 

ru 

fy Glu Gin Pro Lys Gly Pro Phe Gly Glu Val lie Glu Ala Phe Ala Asp 

in 1 5 -io is 

Q 

Gly Leu Ala Gly Lys Gly Lys Gin lie Asn Thr Thr Leu Asn Ser Leu 
|y 20 25 30 

Ser Gin Ala Leu Asn Ala Leu Asn Glu Gly Arg Gly Asp Phe Phe Ala 
35 40 45 

Val Val Arg Ser Leu Ala Leu Phe Val Asn Ala Leu His Gin Asp Asp 
50 55 60 

Gin Gin Phe Val Ala Leu Asn Lys Asn Leu Ala Glu Phe Thr Asp Arg 
65 70 75 80 

Leu Thr His Ser Asp Ala Asp Leu Ser Asn Ala lie Gin Gin Phe Asp 
85 90 95 

Ser Leu Leu Ala Val Ala Arg Pro Phe Phe Ala Lys Asn Arg Glu Val 
100 105 110 

Leu Thr His Asp Val Asn Asn Leu Ala Thr Val Thr Thr Thr Leu Leu 
115 120 125 

Gin Pro Asp Pro Leu Asp Gly Leu Glu Thr Val Leu His lie Phe Pro 
130 135 140 
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Thr Leu Ala Ala Asn lie Asn Gin Leu Tyr His Pro Thr His Gly Gly 
145 150 155 160 

Val Val Ser Leu Ser Ala Phe Thr Asn Phe Ala Asn Pro Met Glu Phe 
165 170 175 

lie Cys Ser Ser lie Gin Ala Gly Ser Arg Leu Gly Tyr Gin Glu Ser 
180 185 190 

Ala Glu Leu Cys Ala Gin Tyr Leu Ala Pro Val Leu Asp Ala lie Lys 
195 200 205 

Phe Asn Tyr Phe Pro Phe Gly Leu Asn Val Ala Ser Thr Ala Ser Thr 
210 215 220 

Leu Pro Lys Glu lie Ala Tyr Ser Glu Pro Arg Leu Gin Pro Piro Asn 
225 230 235 240 

Gly Tyr Lys Asp Thr Thr Val Pro Gly He Trp Val Pro Asp Thr Pro 
245 250 255 

Leu Ser His Arg Asn Thr Gin Pro Gly Trp Val Val Ala Pro Gly Met 
260 265 270 

Gin Gly Val Gin Val Gly Pro He Thr Gin Gly Leu Leu Thr Pro Glu 
275 280 285 

Ser Leu Ala Glu Leu Met Gly Gly Pro Asp He Ala Pro Pro Ser Ser 
290 295 300 

Gly Leu Gin Thr Pro Pro Gly Pro Pro Asn Ala Tyr Asp Glu Tyr Pro 
305 310 315 320 

Val Leu Pro Pro He Gly Leu Gin Ala Pro Gin Val Pro He Pro Pro 
325 330 335 

Pro Pro Pro Gly Pro Asp Val He Pro Gly Pro Val Pro Pro Val Leu 
340 345 350 

Ala Ala lie Val Phe Pro Arg Asp Arg Pro Ala Ala Ser Glu Asn Phe 
355 360 365 

Asp Tyr Met Gly Leu Leu Leu Leu Ser Pro Gly Leu Ala Thr Phe Leu 
370 375 380 

Phe Gly Val Ser Ser Ser Pro Ala Arg Gly Thr Met Ala Asp Arg His 
385 390 395 400 

Val Leu He Pro Ala He Thr Gly Leu Ala Leu He Ala Ala Phe Val 
405 410 415 

Ala His Ser Trp Tyr Arg Thr Glu His Pro Leu He Asp Met Arg Leu 
420 425 . 430 



Phe Gin Asn Arg Ala Val Ala Gin Ala Asn Met Thr Met Thr Val Leu 
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435 440 445 

Ser Leu Gly Leu Phe Gly Ser Phe Leu Leu Leu Pro Ser Tyr Leu Gin 
450 455 460 

Gin Val Leu His Gin Ser Pro Met Gin Ser Gly Val "His He He Pro 
465 470 475 480 

Gin Gly Leu Gly Ala Met Leu Ala Met Pro He Ala Gly Ala Met Met 
485 490 495 

Asp Arg Arg Gly Pro Ala Lys He Val Leu Val Gly He Met Leu He 
500 505 510 

Ala Ala Gly Leu Gly Thr Phe Ala Phe Gly Val Ala Arg Gin Ala Asp 

515 5"20 5*25 

^ Tyr Leu Pro He Leu Pro Thr Gly Leu Ala He Met Gly Met Gly Met 

O 530 535 540 

69 Gly Cys Ser Met Met Pro Leu Ser Gly Ala Ala Val Gin Thr Leu Ala 

■P 545 550 555 560 

CO 

p£ Pro His Gin He Ala Arg Gly Ser Thr Leu He Ser Val Asn Gin Gin 

LU 565 570 575 

£ 

Q Val Gly Gly Ser He Gly Thr Ala Leu Met Ser Val Leu Leu Thr Tyr 

py 580 585 590 

ry 

[fj Gin Phe Asn His Ser Glu He He Ala Thr Ala Lys Lys Val Ala Leu 

£3 595 600 605 

BW Thr Pro Glu Ser Gly Ala Gly Arg Gly Ala Ala Val Asp Pro Ser Ser 

610 615 620 

Leu Pro Arg Gin Thr Asn Phe Ala Ala Gin Leu Leu His Asp Leu Ser 
625 630 635 640 

His Ala Tyr Ala Val Val Phe Val He Ala Thr Ala Leu Val Val Ser 
645 650 655 

Thr Leu He Pro Ala Ala Phe Leu Pro Lys Gin Gin Ala Ser His Arg 
660 665 670 

Arg Ala Pro Leu Leu Ser Ala 
675 

(2) INFORMATION FOR SEQ ID NO: 195: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 120 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 195: 

Thr Pro Glu Lys Ser Phe Val Asp Asp Leu Asp lie Asp Ser Leu Ser 
15 10 15 

Met Val Glu lie Ala Val Gin Thr Glu Asp Lys Tyr Gly Val Lys He 
20 25 30 

Pro Asp Glu Asp Leu Ala Gly Leu Arg Thr Val Gly Asp Val Val Ala 
35 40 45 

Tyr He Gin Lys Leu Glu Glu Glu Asn Pro Glu Ala Ala Gin Ala Leu 
50 55 60 

Arg Ala Lys He Glu Ser Glu Asn Pro Asp Ala Ala Arg Ala Asp Arg 

65 70 75 SO 

M» Cys Val Ser Pro Thr Ser Gin Ala Arg Asp Ala Arg Arg Pro Leu Ala 

D 85 90 95 

a 

t£j Arg Ser Ala Arg Leu Ala Cys Arg Arg Leu Pro Ala Ser Val Pro Thr 

«p 100 105 110 

& 

Thr Arg Arg Asp Pro Arg Glu Arg 

ill 115 120 



fU 

m 



(2) INFORMATION FOR SEQ ID NO:196: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 89 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:196: 

Leu Ala Cys Gin Cys His Arg Arg Tyr Asp Val Gly He Gin Phe Arg 
15 10 15 

Gly Pro Ala Gly Pro Val Ala Thr Gin Ser Gly Pro Pro Gly Pro Ser 
20 25 30 

He Ala Glu Gly Arg Gin Val Arg Ala Gin Cys Gly Ala Gly Phe Leu 
35 40 45 

Glu Arg Arg Pro Ala Val Ser Gly Ala Leu Pro Pro Asn Asn Ala Ser 
50 55 60 

Pro Gly He Arg Ser Arg Ala Ala Asp Ser Gin Arg His Leu Leu Ala 
65 70 75 80 

# 

Gly Asp Gly Ser Asp Val Thr Val Gly 
85 

(2) INFORMATION FOR SEQ ID NO: 197: 
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(i) SEQUENCE CHARACTERISTICS: • 

(A) LENGTH: 119 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 197: 



Ala Ser Leu Leu Ala Tyr Ser Ala Ala Ala Ala Ser Thr Ala Leu Ala 
15 10 15 

Val Ala Cys Val Arg Ala Asp His Arg Asp Arg Arg Thr lie Arg Asp 
20 25 30 

His Leu Ala Met He His Leu Ala Gin Leu Val Thr Gin Pro Pro Gly 
. 35 40 45 

O Gly Val Arg Gin Arg Leu His His Leu Gly He Ala Val Ala Pro Gin 

W 50 55 60 

09 

jjF Pro Gin Glu Val Val Val Leu Ala His His Leu Val Thr Gly Thr Gly 

**J 65 70 75 80 

=§» 

UJ Glu Val Gin Gly Glu Gly Arg His Val Ala Ala Glu Val Val Asp Pro 

5 85 90 95 



Glu Asn Gin He Leu Arg Gin Val Leu Gly Pro Ala Pro His Asp Lys 
100 105 110 



m 

Q Pro Asp Ala Gly He Gly Gin 

ru • 115 

(2) INFORMATION FOR ^SEQ ID NO: 198: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 116 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNES S : 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 198: 



Arg Ala Arg Gly His Arg Ser Ser 
1 5 

Val Leu Glu Gly Cys He Leu Ala 
20 

Ala Ser Pro Ser Pro Ser Arg Pro 
35 40 



Lys Gly Ser Arg Trp Ser His Glu 
10 15 

Asp Ser Arg Gin Ser Lys Thr Ala 
25 30 

Gin Ser Ser Ser Asn Asn Ser Val 
45 



Pro Gly Ala Pro Asn Arg Val Ser Phe Ala Lys Leu Arg Glu Pro Leu 
50 55 60 
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Glu Val Pro Gly Leu Leu Asp Val Gin Thr Asp Ser Phe Glu Trp Leu 
65 70 75 80 

lie Gly Ser Pro Arg Trp Arg Glu Ser Ala Ala Glu Arg Gly Asp Val 
85 90 95 

Asn Pro Val Gly Gly Leu Glu Glu Val Leu Tyr Glu Leu Ser Pro lie 
100 105 110 

Glu Asp Phe Ser 
115 

(2) INFORMATION FOR SEQ ID NO: 199: 

(i) SEQUENCE CHARACTERISTICS: 

CAT) LENGTH : oil base pairs 

(B) TYPE: nucleic acid 
M= (C) STRANDEDNESS : single 

O (D) TOPOLOGY: linear 

00 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 199: 

sjsa 

ffl TGCTACGCAG CAATCGCTTT GGTGACAGAT GTGGATGCCG GCGTCGCTGC TGGCGATGGC 60 

yj GTGAAAGCCG CCGACGTGTT CGCCGCATTC GGGGAGAACA TCGAACTGCT CAAAAGGCTG 120 

Q GTGCGGGCCG CCATCGATCG GGTCGCCGAC GAGCGCACGT GCACGCACTG TCAACACCAC 180 

ru 

~§ GCCGGTGTTC CGTTGCCGTT CGAGCTGCCA TGAGGGTGCT GCTGACCGGC GCGGCCGGCT 240 

P TCATCGGGTC GCGCGTGGAT GCGGCGTTAC GGGCTGCGGG TCACGACGTG GTGGGCGTCG 300 

m 

ACGCGCTGCT GCCCGCCGCG CACGGGCCAA ACCCGGTGCT GCCACCGGGC TGCCAGCGGG 360 

TCGACGTGCG CGACGCCAGC GCGCTGGCCC CGTTGTTGGC CGGTGTCGAT CTGGTGTGTC 420 

ACCAGGCCGC CATGGTGGGT GCCGGCGTCA ACGCCGCCGA CGCACCCGCC TATGGCGGCC 480 

ACAACGATTT CGCCACCACG GTGCTGCTGG CGCAGATGTT CGCCGCCGGG GTCCGCCGTT 540 

TGGTGCTGGC GTCGTCGATG GTGGTTTACG GGCAGGGGCG CTATGACTGT CCCCAGCATG 600 

GACCGGTCGA CCCGCTGCCG CGGCGGCGAG CCGACCTGGA CAATGGGGTC TTCGAGCACC 660 

GTTGCCCGGG GTGCGGCGAG CCAGTCATCT GGCAATTGGT CGACGAAGAT GCCCCGTTGC 720 

GCCCGCGCAG CCTGTACGCG GCAGCAAGAC CGCGCAGGAG CACTACGCGC TGGCGTGGTC 78.0 

GGAAACGAAT GGCGGTTCCG TGGTGGCGTT G 811 
(2) INFORMATION FOR SEQ ID NO: 2 00: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 966 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:200: 
GTCCCGCGAT GTGGCCGAGC ATGACTTTCG GCAACACCGG CGTAGTAGTC GAAGATATCG 
GACTTTGTGG TCCCGGTGGC GGGATAGAGC ACCTGTCGGC GTTGGTCAGC GTCACCCGTT 
GCTCGGACGC CGAACCCATG CTTTCAACGT AGCCTGTCGG TCACACAAGT CGCGAGCGTA 
ACGTCACGGT CAAATATCGC GTGGAATTTC GCCGTGACGT TCCGCTCGCG GACAATCAAG 
GCATACTCAC TTACATGCGA GCCATTTGGA CGGGTTCGAT CGC CTTCGGG CTGGTGAACG 
TGCCGGTCAA GGTGTACAGC GCTACCGCAG ACCACGACAT CAGGTTCCAC CAGGTGCACG 
CCAAGGACAA CGGACGCATC CGGTACAAGC GCGTCTGCGA GGCGTGTGGC GAGGTGGTCG 
ACTACCGCGA TCTTGCCCGG GCCTACGAGT CCGGCGACGG CCAAATGGTG GCGATCACCG 
ACGACGACAT CGCCAGCTTG CCTGAAGAAC GCAGCCGGGA GATCGAGGTG TTGGAGTTCG 
TCCCCGCCGC CGACGTGGAC C CGATGATGT TCGACCGCAG CTACTTTTTG GAGC CTGATT 
CGAAGTCGTC GAAATCGTAT GTGCTGCTGG CTAAGACACT CGCCGAGACC G AC CGGATGG 
CGATCGTGGA TCGCCCCACC GGCCGTGAAT GCAGGAAAAA TAAGAGCCGC TATC CACAAT 
TCGGCGTCGA GCT CGGCTAC CACAAACGGT AGAACGATCG AGACATTCCC GAGCTGAAGT 
GCGGCGCTAT AGAAGCCGCT CTGCGCGATT ATCAAACGCA AAATACGCTT ACTCATGCCA 
TCGGCGCTGC TCACC CGATG CGACGTTTTT GCCACGCTCC ACCGCCTGCC GCGCGACCTC 
AAGTGGGCAT GCATCCCACC CGTTCCCGGA AACCGGTTCC GGCGGGTCGG CTCATCGCTT 
CATCCT 

(2) INFORMATION FOR SEQ ID NO : 201: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2367 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 201: 

CCGCACCGCC GGCAATACCG CCAGCGCCAC CGTTACCGCC GTTTGCGCCG TTGCCCCCGT 

TGCCGCCCGT CCCGCCGGCC C CGC CGATGG AGTTCTCATC GCCAAAAGTA CTGGCGTTGC 

CACCGGAGCC GCCGTTGCCG CCGTCACCGC CAGCCCCGCC GACTCCACCG GCCCCACCGA 



p 

ffl 
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CTCCGCCGCT GCCACCGTTG CCGCCGTTGC CGATCAACAT GCCGCTGGCG CCACCCTTGC 240 

CACCCACGCC ACCGGCTCCG CCCACCCCGC CGACACCAAG CGAGCTGCCG CCGGAGCCAC 300 

CATCACCACC TACGCCACCG ACCGCCCAGA CACCAGCGAC CGGGTCTTCG TGAAACGTCG 360 

CGGTGCCACC ACCGCCGCCG TTACCGCCAA CCCCACCGGC AACGCCGGCG CCGCCATCCC 420 

CGCCGGCCCC GGCGTTGCCG CCGTTGCCGC CGTTGC CGAA CAACAACCCG CCGGCGCCGC 480 

CGTTGCCGCC CGCGCCGCCG GTCCCGCCGG CGCCGCCGAC GCCAAGGCCG CTGCCGCCCT 540 

TGCCGCCATC ACCACCCTTG CCGCCGACCA CATCGGGTTC TGCCTCGGGG TCTGGGCTGT 600 

CAAACCTCGC GATGCCAGCG TTGCCGCCGC TTCCCCCGGG CCCCCCCGTG GCGCCGTCAC 660 

CACCGATACC ACCCGCGCCA CCGGCGCCAC CGTTGCCGCC ATC AC CGAAT AGCAACCCGC 720 

CGGCGCCACC ATTGCCGCCA GCTCCCCCTG CGCCACCGTC GGCGCCGGAG GCGGCACTGG 780 

CAGCCCCGTT ACCACCGAAA CCGCCGCTAC CACCGGTAGA GGTGGCAGTG GCGATGTGTA 84 0 



ffl CGAAAGCGCC GCCTCCGGCG CCGCCGCTAC CACCCCCACT GCCGGCGGCT ACACCGTCGG 900 

hi ACCCGTTGCC ACCATCACCG CCAAAGGCGC TCGCAATGTC GCCCTGCGCG ACTCCGCCGT 96 0 

g 

O CGCCGCCGTT GCCGCCGCCG CCACCGGCAG CGGCGGTACC GCCGTCACCA CCGGCACCGC 102 0 

ru 

fU CGGTGGC CTT GCCCGAGCCT GCCGTCGCGG TGGCACCGTC GCCGCCGGTG CCACCGGTCG 1080 

in 

Q GCGTGCCGGC AGTGCCATGG CCGCCCGTGC CGCCGTCGCC GCCGGTTTGA TCACCGATGC 114 0 

m 

CGGACACATC TGC CGGGCTG TCCCCGGTGC TGGCCGCGGG GCCGGGCGTG GGATTGACCC 12 00 

CGTTTGCCCC GGCGAGGCCG GCGCCGCCGG TACCACCGGC GCCGCCATGG CCGAACAGCC 1260 

CGGCGTTGCC GCCGTTACCG CCCGCACCCC CGATGCCTGC GGCCACGCTG GTGCCGCCGA 13 2 0 

CACCGCCGTT GCCGCCGTTG CCCCACAACC ACCCCCCGTT CCCACCGGCA CCGCCGGCCG 1380 

CGCCGGTACC ACCGGCCCCG CCGTTGCCGC CGTTGCCGAT CAACCCGGCC GCGCCTCCGC 144 0 

TGCCGCCGGT TTGACCGAAC CCGCCAGCCG CGCCGTTGCC ACCGTTGCCA AACAGCAACC 1500 

CGCCGGCCGC GCCAGGCTGC CCGGGTGCCG TCCCGTCGGC GCCGTTTCCG ATCAACGGGC 156 0 

GCCCCAAAAG CGCCTCGGTG GGCGCATTCA CCGCACCCAG CAGACTCCGC TCAACAGCGG 162 0 

CTT CAGTGCT GGCATACCGA CCCGCGGCCG CAGTCAACGC CTGCACAAAC TGCTCGTGAA 16 80 

ACGCTGCCAC CTGTACGCTG AGCGCCTGAT ACTGCCGAGC ATGGGCCCCG AACAACCCCG 174 0 

CAATCGCCGC CGACACTTCA TCGGCAGCCG CAGCCACCAC TTCCGTCGTC GGGATCGCCG 1800 

CGGCCGCATT AGCCGCGCTC ACCTGCGAAC CAATAGTCGA TAAATC CAAA GCCGCAGTTG 186 0 
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CCAGCAGCTG CGGCGTCGCG ATCACCAAGG ACACCTCGCA CCTCCGGATA CCCCATATCG 1920 

CCGCACCGTG TCCCCAGCGG CCACGTGACC TTTGGTCGCT GGCTGGCGGC CCTGACTATG 1980 

GCCGCGACGG CCCTCGTTCT GATTCGCCCC GGCGCGCAGC TTGTTGCGCG AGTTGAAGAC 2040 

GGGAGGACAG GCCGAGCTTG GTGTAGACGT GGGTCAAGTG GGAATGCACG GTCCGCGGCG 2100 

AGATGAATAG GCGGACGCCG ATCTCCTTGT TGCTGAGTCC CTCACCGACC AGTAGAGCCA 2160 

CCTCAAGCTC TGTCGGTGTC AACGCGCCCC AGCCACTTGT CGGGCGTTTC CGTGCACCGC 2220 

GGCCTCGTTG CGCGTACGCG ATCGCCTCAT CGATCGATAA CGCAGTTCCT TCGGCCCAGG 22 80 

CATGGTCGAA GTCGCTGTCA GCCATGGATT TTCGAAGGGT GGCTAGGGAC GAGTTAGAGC 234 G 

CCGCCTGGTA GATCCCGAAG CGGACCG 2367 

la 

Q (2) INFORMATION FOR SEQ ID NO: 2 02: 

=P (i) SEQUENCE CHARACTERISTICS: 

CO (A) LENGTH: 376 amino acids 

«p (B) TYPE: amino acid 

yj ( C ) STRANDEDNES S : 

s (D) TOPOLOGY: linear 



D 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:202: 



i y 

[ft Gin Pro Ala Gly Ala Thr lie Ala Ala Ser Ser Pro Cys Ala Thr Val 

P 1 5 10 15 

Gly Ala Gly Gly Gly Thr Gly Ser Pro Val Thr Thr Glu Thr Ala Ala 
20 .25 30 

Thr Thr Gly Arg Gly Gly Ser Gly Asp Val Tyr Glu Ser Ala Ala Ser 
35 40 45 

Gly Ala Ala Ala Thr Thr Pro Thr Ala Gly Gly Tyr Thr Val Gly Pro 
50 55 60 

Val Ala Thr lie Thr Ala Lys Gly Ala Arg Asn Val Ala Leu Arg Asp 
65 70 75 80 

Ser Ala Val Ala Ala Val Ala Ala Ala Ala Thr Gly Ser Gly Gly Thr 
85 90 95 

Ala Val Thr Thr Gly Thr Ala Gly Gly Leu Ala Arg Ala Cys Arg Arg 
100 105 110 

Gly Gly Thr Val Ala Ala Gly Ala Thr Gly Arg Arg Ala Gly Ser Ala 
115 120 125 

Met Ala Ala Arg Ala Ala Val Ala Ala Gly Leu lie Thr Asp Ala Gly 
130 135 140 
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His lie Cys Arg Ala Val Pro Gly Ala Gly Arg Gly Ala Gly Arg Gly 
145 150 155 160 

lie Asp Pro Val Cys Pro Gly Glu Ala Gly Ala Ala Gly Thr Thr Gly 
165 170 - 175 

Ala Ala Met Ala Glu Gin Pro Gly Val Ala Ala Val Thr Ala Arg Thr 
180 185 190 

Pro Asp Ala Cys Gly His Ala Gly Ala Ala Asp Thr Ala Val Ala Ala 
195 200 205 

Val Ala Pro Gin Pro Pro Pro Val Pro Thr Gly Thr Ala Gly Arg Ala 
210 215 220 

Gly Thr Thr Gly Pro Ala Val Ala Ala Val Ala Asp Gin Pro Gly Arg 

H; 225 230 235 240 

O Ala Ser Ala Ala Ala Gly Leu Thr Glu Pro Ala Ser Arg Ala Val Ala 

CO 245 250 255 

=F 

03 Thr Val Ala Lys Gin Gin Pro Ala Gly Arg Ala Arg Leu Pro Gly Cys 

=P 260 265 270 

la 

e Arg Pro Val Gly Ala Val Ser Asp Gin Arg Ala Pro Gin Lys Arg Leu 

Q 275 280 285 

m 

fy Gly Gly Arg lie His Arg Thr Gin Gin Thr Pro Leu Asn Ser Gly Phe 

[fi 290 295 300 



py Ser Ala Gly lie Pro Thr Arg Gly Arg Ser Gin Arg Leu His Lys Leu 

305 310 315 320 

Leu Val Lys Arg Cys His Leu Tyr Ala Glu Arg Leu lie Leu Pro Ser 
325 330 335 

Met Gly Pro Glu Gin Pro Arg Asn Arg Arg Arg His Phe lie Gly Ser 
340 345 350 

Arg Ser His His Phe Arg Arg Arg Asp Arg Arg Gly Arg lie Ser Arg 
355 360 365 

Ala His Leu Arg Thr Asn Ser Arg 
370 375 

(2) INFORMATION FOR SEQ ID NO: 2 03: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2852 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:203: 
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GGCCAAAACG CCCCGGCGAT CGCGGCCACC 
GACGTGGCGG CGATGTTTGG CTACCATGCC 
CCGTTCGGCC AGGCGCTGCC GACCGTGGCG 
GCTCAGGTGA CCACGCGGGT CTTCCGCAAC 
GTCCGCAACG GTAATGTCCG GAACTTCAAT 
ATCGGCAGCG GCAACATCGG CAGCTCCAAC 
ACCGCAGCGC TGAACAACAT CGGTTTCGGC 
AACACCGGCA GCAACAACAT CGGGTTCGGC 




CTCACGGGTA GCGGTTTGTT GGGGTTCGGC 



O CTGTTCAACT CGGGCACCGG AAACGTCGGC 

m 

ATTGGCAACT CGGGCAACAG CTACAACACC 

DO 

=C GGCTTCTTCA ACTCCGGAAT AGCCAACACC 

W 

■ GGTAGCTACA ACCCGGGCAA CAGCAATACC 

fU GGCTACCTGA ACAGCGGCAA CTACAACACC 



HI GGCGCCTTCA TTACTGGCAA CTTCAACAAC 

o 

rsl CTGATTTTCG GGAGCCCCGG CTTCTTCAAC 



AACAGCGGTG CCGGTAGCGC GTCCGGCTTC 



TTCAACTCTT CGTCGGGGGC CATCGGTAAC 



TCGGGCGTGA TCAACTCGGG CAACACCGTA 



ATCACAACGC CGGCCTTGAT CTCGGGCTTC 



TTCGGTGGCC CACCGGTCTT CAATCTCGGC 



GGCAACGCCA ACATCGGCAA TTACAACATT 



ATCCTTGGCA GCGGCAACCT CGGCAGCCAA 



TTCAATATCG GCAGTGGAAA CATCGGAGTA 



TACAACATCG GATCCGGAAA CCTCGGGATC 



TACAACGTCG GCTTCGGGAA CGCGGGCGAC 



AACAACATCG GGTTCGCCAA CACCGGCAAC 



GAGGCCGCCT ACGACCAGAT GTGGGCCCAG 60 

GGGGCTTCGG CGGCCGTCTC GGCGTTGACA 120 

GGCGGCGGTG CGCTGGTCAG CGCGGCCGCG 180 

CTGGGCTTGG CGAACGTCCG CGAGGGCAAC 240 

CTCGGCTCGG CCAACATCGG CAACGGCAAC 300 

ATCGGGTTTG GCAACGTGGG TCCTGGGTTG 3 60 

AACACCGGCA GCAACAACAT CGGGTTTGGC 42 0 

AATACCGGAG ACGGCAACCG AGGTATCGGG 480 

GGCCTGAACT CGGGCACCGG CAACATCGGT 54 0 

ATCGGCAACT CGGGTACCGG GAACTGGGGC 600 

GGTTTTGGCA ACTCCGGCGA CGCCAACACG 660 

GGCGTCGGCA ACGCCGGCAA CTACAACACC 720 

GGCGGCTTCA ACATGGGCCA GTACAACACG 780 

GGCTTGGCAA ACTCCGGCAA TGTCAACACC 84 0 

GGCTTCTTGT GGCGCGGCGA CCACCAAGGC 900 

TCGACCAGTG CGCCGTCGTC GGGATTCTTC 960 

CTGAACTCCG GTGCCAACAA TTCTGGCTTC 102 0 

TCCGGCCTGG CAAACGCGGG CGTGCTGGTA 1080 

TCGGGTTTGT TCAACATGAG CCTGGTGGCC 1140 

TTCAACACCG GAAGCAACAT GTCGGGATTT 1200 

CTGGCAAACC GGGGCGTCGT GAACATTCTC 126 0 

CTCGGCAGCG GAAACGTCGG TGACTTCAAC 1320 

AACATCTTGG GCAGCGGCAA CGT CGGCAGC 1380 

TTCAATGTCG GTTCCGGAAG CCTGGGAAAC 144 0 

TACAACATCG GTTTTGGAAA CGTCGGCGAC 1500 

TTCAACCAAG GCTTTGC CAA CACCGGCAAC 156 0 

AACAACATCG GCATCGGGCT GTCCGGCGAC 1620 




195 

AACCAGCAGG GCTTCAATAT TGCTAGCGGC TGGAACTCGG GCACCGGCAA CAGCGGCCTG 1680 

TTCAATTCGG GCACCAATAA CGTTGGCATC TTCAACGCGG GCACCGGAAA CGTCGGCATC 1740 

GCAAACTCGG GCACCGGGAA CTGGGGTATC GGGAACCCGG GTACCGACAA TACCGGCATC 1800 

CTCAATGCTG GCAGCTACAA CACGGGCATC CTCAACGCCG GCGACTTCAA CACGGGCTTC 1860 

TACAACACGG GCAGCTACAA CACCGGCGGC TTCAACGTCG GTAACAC CAA CACCGGCAAC 1920 

TTCAACGTGG GTGACACCAA TACCGGCAGC TATAACC CGG GTGACACCAA CACCGGCTTC 1980 

TTCAATCCCG GCAACGTCAA TACCGGCGCT TTCGACACGG GCGACTTCAA CAATGGCTTC 204 0 

TTGGTGGCGG GCGATAACCA GGGCCAGATT GCCATCGATC TCTCGGTCAC CACTCCATTC 2100 

ATCCCCATAA ACGAGCAGAT GGTCATTGAC GTACACAACG TAATGAC CTT CGGCGGCAAC 2160 



^ ATGATCACGG TCACCGAGGC CTCGACCGTT TTCCCCCAAA CCTTCTATCT GAGCGGTTTG 222 0 

o 

00 TTCTTCTTCG GCCCGGTCAA TCTCAGCGCA TCCACGCTGA CCGTTCCGAC GATCACCCTC 22 80 
W AC CATCGGCG GACCGACGGT GACCGTCCCC ATCAGCATTG TCGGTGCTCT GGAGAGCCGC 234 0 
Id ACGATTACCT TCCTCAAGAT CGATCCGGCG CCGGGCATCG GAAATTCGAC CACCAACCCC 24 00 

E 

Q TCGTCCGGCT TCTTCAACTC GGGCACCGGT GGCACATCTG GCTTCCAAAA CGTCGGCGGC 246 0 

1 y 

ftf GGCAGTTCAG GCGTCTGGAA CAGTGGTTTG AGCAGCGCGA TAGGGAATTC GGGTTTCCAG 2 520 



m 

Q AACCTCGGCT CGCTGCAGTC AGGCTGGGCG AAC CTGGGCA ACTCCGTATC GGGCTTTTTC 2580 

ry 

AACAC CAGT A CGGTGAACCT CTCCACGCCG GCCAATGTCT CGGGCCTGAA CAACATCGGC 264 0 

ACCAACCTGT CCGGCGTGTT CCGCGGTCCG ACCGGGACGA TTTTCAACGC GGGCCTTGCC 2700 

AACCTGGGCC AGTTGAACAT CGGCAGCGCC TCGTGCCGAA TTCGGCACGA GTTAGATACG 2760 

GTTTCAACAA TCATATCCGC GTTTTGCGGC AGTGCATCAG ACGAATCGAA CCCGGGAAGC 2820 

GTAAGCGAAT AAACCGAATG GCGGCCTGTC AT 28 52 
(2) INFORMATION FOR SEQ ID NO:204: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 943 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

( D ) TOPOLOGY : 1 inear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:204: 

Gly Gin Asn Ala Pro Ala lie Ala Ala Thr Glu Ala Ala Tyr Asp Gin 
15 10 15 
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Met Trp Ala Gin Asp Val Ala Ala Met Phe Gly Tyr His Ala Gly Ala 
20 25 30 

Ser Ala Ala Val Ser Ala Leu Thr Pro Phe Gly Gin Ala Leu Pro Thr 
35 40 45 

Val Ala Gly Gly Gly Ala Leu Val Ser Ala Ala Ala Ala Gin Val Thr 
50 55 60 

Thr Arg Val Phe Arg Asn Leu Gly Leu Ala Asn Val Arg Glu Gly Asn 
65 70 75 80 

Val Arg Asn Gly Asn Val Arg Asn Phe Asn Leu Gly Ser Ala Asn lie 
85 90 95 

Gxy Asn Gly Asn He Gly Ser Gly Asn lie Gly Ser Se-r Asn He S-ly 
100 105 110 

Phe Gly Asn Val Gly Pro Gly Leu Thr Ala Ala Leu Asn Asn He Gly 
115 120 125 

Phe Gly Asn Thr Gly Ser Asn Asn He Gly Phe Gly Asn Thr Gly Ser 
130 135 140 

Asn Asn He Gly Phe Gly Asn Thr Gly Asp Gly Asn Arg Gly He Gly 
145 150 155 160 

Leu Thr Gly Ser Gly Leu Leu Gly Phe Gly Gly Leu Asn Ser Gly Thr 
165 170 175 

Gly Asn He Gly Leu Phe Asn Ser Gly Thr Gly Asn Val Gly He Gly 
180 185 190 

Asn Ser Gly Thr Gly Asn Trp Gly He Gly Asn Ser Gly Asn Ser Tyr 
195 200 205 

Asn Thr Gly Phe Gly Asn Ser Gly Asp Ala Asn Thr Gly Phe Phe Asn 
210 215 220 

Ser Gly He Ala Asn Thr Gly Val Gly Asn Ala Gly Asn Tyr Asn Thr 
225 230 235 240 

Gly Ser Tyr Asn Pro Gly Asn Ser Asn Thr Gly Gly Phe Asn Met Gly 
245 250 255 

Gin Tyr Asn Thr Gly Tyr Leu Asn Ser Gly Asn Tyr Asn Thr Gly Leu 
260 265 270 

Ala Asn Ser Gly Asn Val Asn Thr Gly Ala Phe He Thr Gly Asn Phe 
275 280 285 

Asn Asn Gly Phe Leu Trp Arg Gly Asp His Gin Gly Leu He Phe Gly 
290 295 300 



Ser Pro Gly Phe Phe Asn Ser Thr Ser Ala Pro Ser Ser Gly Phe Phe 
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305 310 315 320 

Asn Ser Gly Ala Gly Ser Ala Ser Gly Phe Leu Asn Ser Gly Ala Asn 
325 330 335 

Asn Ser Gly Phe Phe Asn Ser Ser Ser Gly Ala Ile-Gly Asn Ser Gly 
340 345 350 

Leu Ala Asn Ala Gly Val Leu Val Ser Gly Val lie Asn Ser Gly Asn 
355 360 365 

Thr Val Ser Gly Leu Phe Asn Met Ser Leu Val Ala He Thr Thr Pro 
370 375 380 

Ala Leu He Ser Gly Phe Phe Asn Thr Gly Ser Asn Met Ser Gly Phe 

385 390 3-95 -4GG 

Phe Gly Gly Pro Pro Val Phe Asn Leu Gly Leu Ala Asn Arg Gly Val 
405 410 415 

Val Asn He Leu Gly Asn Ala Asn He Gly Asn Tyr Asn He Leu Gly 
420 425 430 

Ser Gly Asn Val Gly Asp Phe Asn He Leu Gly Ser Gly Asn Leu Gly 
435 440 445 

Ser Gin Asn He Leu Gly Ser Gly Asn Val Gly Ser Phe Asn He Gly 
450 455 460 

Ser Gly Asn He Gly Val Phe Asn Val Gly Ser Gly Ser Leu Gly Asn 
465 470 475 480 

Tyr Asn He Gly Ser Gly Asn Leu Gly He Tyr Asn He Gly Phe Gly 
485 490 495 

Asn Val Gly Asp Tyr Asn Val Gly Phe Gly Asn Ala Gly Asp Phe Asn 
500 505 510 

Gin Gly Phe Ala Asn Thr Gly Asn Asn Asn He Gly Phe Ala Asn Thr 
515 520 525 

Gly Asn Asn Asn He Gly He Gly Leu Ser Gly Asp Asn Gin Gin Gly 
530 535 540 

Phe Asn He Ala Ser Gly Trp Asn Ser Gly Thr Gly Asn Ser Gly Leu 
545 550 555 560 

Phe Asn Ser Gly Thr Asn Asn Val Gly He Phe Asn Ala Gly Thr Gly 
565 570 575 

Asn Val Gly He Ala Asn Ser Gly Thr Gly Asn Trp Gly He Gly Asn 
580 585 590 

Pro Gly Thr Asp Asn Thr Gly He Leu Asn Ala Gly Ser Tyr Asn Thr 
595 600 605 
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Gly lie Leu Asn Ala Gly Asp Phe Asn Thr Gly Phe Tyr Asn Thr Gly 
610 615 620 

Ser Tyr Asn Thr Gly Gly Phe Asn Val Gly Asn Thr Asn Thr Gly Asn 
625 630 635 - 640 

Phe Asn Val Gly Asp Thr Asn Thr Gly Ser Tyr Asn Pro Gly Asp Thr 
645 650 655 

Asn Thr Gly Phe Phe Asn Pro Gly Asn Val Asn Thr Gly Ala Phe Asp 
660 665 670 

Thr Gly Asp Phe Asn Asn Gly Phe Leu Val Ala Gly Asp Asn Gin Gly 
675 680 685 

Gin lie Ala lie Asp Leu Ser Val Thr Thr Pro Phe lie Pro He Asn 
690 695 700 

Glu Gin Met Val lie Asp Val His Asn Val Met Thr Phe Gly Gly Asn 
705 710 715 720 

Met He Thr Val Thr Glu Ala Ser Thr Val Phe Pro Gin Thr Phe Tyr 
725 730 735 

Leu Ser Gly Leu Phe Phe Phe Gly Pro Val Asn Leu Ser Ala Ser Thr 
740 745 750 

Leu Thr Val Pro Thr He Thr Leu Thr He Gly Gly Pro Thr Val Thr 
755 760 765 

Val Pro He Ser He Val Gly Ala Leu Glu Ser Arg Thr He Thr Phe 
770 775 780 

Leu Lys He Asp Pro Ala Pro Gly He Gly Asn Ser Thr Thr Asn Pro 
785 790 795 800 

Ser Ser Gly Phe Phe Asn Ser Gly Thr Gly Gly Thr Ser Gly Phe Gin 
805 810 815 

Asn Val Gly Gly Gly Ser Ser Gly Val Trp Asn Ser Gly Leu Ser Ser 
820 825 830 

Ala He Gly Asn Ser Gly Phe Gin Asn Leu Gly Ser Leu Gin Ser Gly 
835 840 845 

Trp Ala Asn Leu Gly Asn Ser Val Ser Gly Phe Phe Asn Thr Ser Thr 
850 855 860 

Val Asn Leu Ser Thr Pro Ala Asn Val Ser Gly Leu Asn Asn He Gly 
865 870 875 880 



Thr Asn Leu Ser Gly Val Phe Arg Gly Pro Thr Gly Thr He Phe Asn 
885 890 895 
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Ala Gly Leu Ala Asn Leu Gly Gin Leu Asn lie Gly Ser Ala Ser Cys 
900 905 910 

Arg lie Arg His Glu Leu Asp Thr Val Ser Thr lie lie Ser Ala Phe 
915 920 925 

Cys Gly Ser Ala Ser Asp Glu Ser Asn Pro Gly Ser Val Ser Glu 
930 935 940 

(2) INFORMATION FOR SEQ ID NO: 2 05: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

Ljl (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 205: 

5 

Q 

Q GGATCCATAT GGGC CATCAT CATCATCATC ACGTGATCGA CATCATCGGG ACC 53 



3 



m 



RJ 



(2) INFORMATION FOR SEQ ID NO:206; 



£ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 206: 
CCTGAATTCA GGCCTCGGTT GCGCCGGCCT CATCTTGAAC GA 42 
(2) INFORMATION FOR SEQ ID NO: 207: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:207: 
GGATCCTGCA GGCTCGAAAC CACCGAGCGG T 31 
(2) INFORMATION FOR SEQ ID NO: 208: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:208: 
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CTCTGAATTC AGCGCTGGAA ATCGTCGCGA T 
(2) INFORMATION FOR SEQ ID NO: 209: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 209: 
GGATCCAGCG CTGAGATGAA GACCGATGCC GCT 
(2) INFORMATION FOR SEQ ID NO: 210: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 10 
GGATATCTGC AGAATTCAGG TTTAAAGCCC ATTTGCGA 
(2) INFORMATION FOR SEQ ID NO: 211: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 211 
CCGCATGCGA GCCACGTGCC CACAACGGCC 
(2) INFORMATION FOR SEQ ID NO: 212: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 7 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 212 
CTTCATGGAA TTCTCAGGCC GGTAAGGTCC GCTGCGG 
(2) INFORMATION FOR SEQ ID NO: 213: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 76 76 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 213: 
TGGCGAATGG GACGCGCCCT GTAGCGGCGC ATTAAGCGCG GCGGGTGTGG TGGTTACGCG 
CAGCGTGACC GCTACACTTG CCAGCGCCCT AGCGCCCGCT CCTTTCGCTT TCTTCCCTTC 
CTTTCTCGCC ACGTTCGCCG GCTTTCCCCG TCAAGCTCTA AATCGGGGGC TCCCTTTAGG 
GTTCCGATTT AGTGCTTTAC GGCACCTCGA CCCCAAAAAA CTTGATTAGG GTGATGGTTC 
ACGTAGTGGG CCATCGCCCT GATAGACGGT TTTTCGCCCT TTGACGTTGG AGTCCACGTT 
CTTTAATAGT GGACTCTTGT TCCAAACTGG AACAACACTC AACCCTATCT CGGTCTATTC 
TTTTGATTTA TAAGGGATTT TGCCGATTTC GGCCTATTGG TTAAAAAATG AGCTGATTTA 
ACAAAAATTT AACGCGAATT TTAACAAAAT ATTAACGTTT ACAATTTCAG GTGGCACTTT 
TCGGGGAAAT GTGCGCGGAA CCCCTATTTG TTTATTTTTC TAAATACATT CAAATATGTA 
TCCGCTCATG AATTAATTCT TAGAAAAACT CATCGAGCAT CAAATGAAAC TGCAATTTAT 
TCATATCAGG ATTATCAATA CCATATTTTT GAAAAAGCCG TTTCTGTAAT GAAGGAGAAA 
ACTCACCGAG GCAGTTCCAT AGGATGGCAA GATCCTGGTA TCGGTCTGCG ATTCCGACTC 
GTCCAACATC AATACAACCT ATTAATTTCC CCTCGTCAAA AATAAGGTTA TCAAGTGAGA 
AATCACCATG AGTGACGACT GAATCCGGTG AGAATGGCAA AAGTTTATGC ATTTCTTTCC 
AGACTTGTTC AACAGGCCAG CCATTACGCT CGTCATCAAA ATCACTCGCA TCAACCAAAC 
CGTTATTCAT TCGTGATTGC GCCTGAGCGA GACGAAATAC GCGATCGCTG TTAAAAGGAC 
AATTACAAAC AGGAATCGAA TGCAACCGGC GCAGGAACAC TGCCAGCGCA TCAACAATAT 
TTTCACCTGA. ATCAGGATAT TCTTCTAATA CCTGGAATGC TGTTTTCCCG GGGATCGCAG 
TGGTGAGTAA CCATGCATCA TCAGGAGTAC GGATAAAATG CTTGATGGTC GGAAGAGGCA 
TAAATTCCGT CAGCCAGTTT AGTCTGACCA TCTCATCTGT AACATCATTG GCAACGCTAC 
CTTTGCCATG TTTCAGAAAC AACTCTGGCG CATCGGGCTT C C CAT ACAAT CGATAGATTG 
TCGCACCTGA TTGCCCGACA TTATCGCGAG CCCATTTATA C C CAT AT AAA TCAGCATCCA 
TGTTGGAATT TAATCGCGGC CTAGAGCAAG ACGTTTCCCG TTGAATATGG C T C AT AAC AC 
CCCTTGTATT ACTGTTTATG TAAGCAGACA GTTTTATTGT TCATGACCAA AATCCCTTAA 
CGTGAGTTTT CGTTCCACTG AGCGTCAGAC CCCGTAGAAA AGATCAAAGG ATCTTCTTGA 
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GATCCTTTTT TTCTGCGCGT AATCTGCTGC TTGCAAACAA AAAAACCACC GCTACCAGCG 1560 

GTGGTTTGTT TGCCGGATCA AGAGCTACCA ACTCTTTTTC CGAAGGTAAC TGGCTT CAGC 1620 

AGAGCGCAGA T AC CAAAT AC TGTCCTTCTA GTGTAGCCGT AGTTAGGCCA CCACTTCAAG 1680 

AACTCTGTAG CACCGCCTAC ATACCTCGCT CTGCTAATCC TGTTAC CAGT GGCTGCTGCC 1740 

AGTGGCGATA AGTCGTGTCT T AC CGGGTTG GACTCAAGAC GATAGTTACC GGATAAGGCG 1800 

CAGCGGTCGG GCTGAACGGG GGGTTCGTGC ACACAGCCCA GCTTGGAGCG AACGACCTAC 1860 

ACCGAACTGA GATACCTACA GCGTGAGCTA TGAGAAAGCG CCACGCTTCC CGAAGGGAGA 192 0 

AAGGCGGACA GGTATCCGGT AAGCGGCAGG GTCGGAACAG GAGAGCGCAC GAGGGAGCTT 1980 

CCAGGGGGAA ACGCCTGGTA TCTTTATAGT CCTGTCGGGT TTCGCCACCT CTGACTTGAG 2 040 

CGTCGATTTT TGTGATGCTC GTCAGGGGGG CGGAGCCTAT GGAAAAACGC CAGCAACGCG 2100 

GCCTTTTTAC GGTTCCTGGC CTTTTGCTGG CCTTTTGCTC ACATGTTCTT TCCTGCGTTA 2160 

TCCCCTGATT CTGTGGATAA CCGTATTACC GCCTTTGAGT GAGCTGATAC CGCTCGCCGC 2220 

AGCCGAACGA CCGAGCGCAG CGAGTCAGTG AGCGAGGAAG CGGAAGAGCG CCTGATGCGG 2280 

TATTTTCTCC TTACGCATCT GTGCGGTATT TCACACCGCA TATATGGTGC ACTCTCAGTA 2340 

CAATCTGCTC TGATGCCGCA TAGTTAAGCC AGTATACACT CCGCTATCGC TACGTGACTG 24 00 

GGTCATGGCT GCGCCCCGAC ACCCGCCAAC ACCCGCTGAC GCGCCCTGAC GGGCTTGTCT 2460 

GCTCCCGGCA TCCGCTTACA GACAAGCTGT GACCGTCTCC GGGAGCTGCA TGTGTCAGAG 252 0 

GTTTTCACCG TCATCACCGA AACGCGCGAG GCAGCTGCGG TAAAGCTCAT CAGCGTGGTC 2580 

GTGAAGCGAT TCACAGATGT CTGCCTGTTC ATCCGCGTCC AGCTCGTTGA GTTTCTCCAG 2640 

AAGCGTTAAT GTCTGGCTTC TGATAAAGCG GGCCATGTTA AGGGCGGTTT TTTCCTGTTT 2700 

GGTCACTGAT GCCTCCGTGT AAGGGGGATT TCTGTTCATG GGGGTAATGA TAC CGATGAA 2760 

ACGAGAGAGG ATGCTCACGA TACGGGTTAC TGATGATGAA CATGCCCGGT TACTGGAACG 2820 

TTGTGAGGGT AAACAACTGG CGGTATGGAT GCGGCGGGAC CAGAGAAAAA TCACTCAGGG 288 0 

TCAATGC CAG CGCTTCGTTA ATACAGATGT AGGTGTTCCA CAGGGTAGCC AGCAGCATCC 2 94 0 

TGCGATGCAG ATCCGGAACA TAATGGTGCA GGGCGCTGAC TTCCGCGTTT C C AG ACTTTA 3 000 

CGAAACACGG AAACCGAAGA CCATTCATGT TGTTGCTCAG GTCGCAGACG TTTTGCAGCA 3060 

GCAGTCGCTT CACGTTCGCT CGCGTATCGG TGATTCATTC TGCTAAC CAG TAAGGCAACC 3120 

CCGCCAGCCT AGCCGGGTCC TCAACGACAG GAGCACGATC ATGCGCACCC GTGGGGCCGC 3180 
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CATGCCGGCG ATAATGGCCT GCTTCTCGCC GAAACGTTTG GTGGCGGGAC CAGTGACGAA 
GGCTTGAGCG AGGGCGTGCA AGATTCCGAA TACCGCAAGC GACAGGCCGA TCATCGTCGC 
GCTCCAGCGA AAGCGGTCCT CGCCGAAAAT GACCCAGAGC GCTGCCGGGA CCTGTCCTAC 
GAGTTGCATG ATAAAGAAGA CAGTCATAAG TGCGGCGACG ATAGTCATGC CCCGCGCCCA 
CCGGAAGGAG CTGACTGGGT TGAAGGCTCT CAAGGGCATC GGTCGAGATC CCGGTGCCTA 
ATGAGTGAGC TAACTTACAT TAATTGCGTT GCGCTCACTG CCCGCTTTCC AGTCGGGAAA 
CCTGTCGTGC CAGCTGCATT AATGAATCGG CCAACGCGCG GGGAGAGGCG GTTTGCGTAT 

TGGGCGCCAG GGTGGTTTTT CTTTTGAGG? .^wijiOAogco. .~.-».*~-»r*™~-» ~~~~ 

CCGCCTGGCC CTGAGAGAGT TGCAGCAAGC GGTCCACGCT GGTTTGCCCC AGCAGGCGAA 
AATCCTGTTT GATGGTGGTT AACGGCGGGA TATAACATGA GCTGTCTTCG GTATCGTCGT 
ATCCCACTAC CGAGATATCC GCACCAACGC GCAGCCCGGA CTCGGTAATG GCGCGCATTG 
CGCCCAGCGC CATCTGATCG TTGGCAACCA GCATCGCAGT GGGAACGATG CCCTCATTCA 
GCATTTGCAT GGTTTGTTGA AAACCGGACA TGGCACTCCA GTCGCCTTCC CGTTCCGCTA 
TCGGCTGAAT TTGATTGCGA GTGAGATATT TATGCCAGCC AGCCAGACGC AGACGCGCCG 
AGACAGAACT TAATGGGCCC GCTAACAGCG CGATTTGCTG GTGACCCAAT GCGACCAGAT 
GCTCCACGCC CAGTCGCGTA CCGTCTTCAT GGGAGAAAAT AATACTGTTG ATGGGTGTCT 
GGTCAGAGAC ATCAAGAAAT AACGCCGGAA CATTAGTGCA GGCAGCTTCC ACAGCAATGG 
CATCCTGGTC ATCCAGCGGA TAGTTAATGA TCAGCCCACT GACGCGTTGC GCGAGAAGAT 
TGTGCACCGC CGCTTTACAG GCTTCGACGC CGCTTCGTTC TACCATCGAC ACCACCACGC 
TGGCACCCAG TTGATCGGCG CGAGATTTAA TCGCCGCGAC AATTTGCGAC GGCGCGTGCA 
GGGCCAGACT GGAGGTGGCA ACGCCAATCA GCAACGACTG TTTGCCCGCC AGTTGTTGTG 
CCACGCGGTT GGGAATGTAA TTCAGCTCCG CCATCGCCGC TTCCACTTTT TCCCGCGTTT 
TCGCAGAAAC GTGGCTGGCC TGGTTCACCA CGCGGGAAAC GGTCTGATAA GAGACACCGG 
CATACTCTGC GACATCGTAT AACGTTACTG GTTTCACATT CACCACCCTG AATTGACTCT 
CTTCCGGGCG CTATCATGCC ATACCGCGAA AGGTTTTGCG CCATTCGATG GTGTCCGGGA 
TCTCGACGCT CTCCCTTATG CGACTCCTGC ATTAGGAAGC AGCCCAGTAG TAGGTTGAGG 
CCGTTGAGCA CCGCCGCCGC AAGGAATGGT GCATGCAAGG AGATGGCGCC CAACAGTCCC 
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CCGGCCACGG 


GGCCTGCCAC 


CATACCCACG CCOAAAraar: 


(- GCTCATGAG CCCGAAGTGG 


4860 


CGAGCC CGAT 


CTTCCCCATC 


GGTGATGTCG GCGATATACif; 


uvjUCAGCAAC CGCACCTGTG 


4920 


GCGCCGGTGA 


TGCCGGCCAC 


GATGCGTCCG GCGTAG A f?n A 


X CGAGATCTC GATCCCGCGA 


4980 


AATTAATACG 


ACTCACTATA 


GGGGAATTGT GAGCGGATAA 


CAATTCCCCT CTAGAAATAA 


5040 


TTTTGTTTAA 


CTTTAAGAAG 


GAGATATACA TATGGGCCAT 


UAiUATCATC ATCACGTGAT 


5100 


CGACATCATC 


GGGACCAGCC 


CCACATCCTG GGAACAGGCG 


LjCUGCGGAGG cggtccagcg 


5160 


GGCGCGGGAT 


AGCGTCGATG 


ACATCCGCGT CGCTCGGGTC 


ATTGAGCAGG ACATGGCCGT 


5220 


GGACAGCGCC 


GGCAAGATCA 


CCTACCGCAT CAAGCTCGAA 


GTGTCGTTCA AGATGAGGCC 


5280 


GGCGCAACCG 


AGGGGCTCGA AACCACCGAG CGGTTCGCCT 


GAAACGGGCG CCGGCGCCGG 


5340 


TACTGTCGCG 


ACTACCCCCG 


CGTCGTCGCC GGTGACGTTG 


GCGGAGACCG GTAGCACGCT 


5400 


GCTCTACCCG 


CTGTTCAACC 


TGTGGGGTCC GGCCTTTCAC 


GAGAGGTATC CGAACGTCAC 


5460 


GATCACCGCT 


CAGGGCACCG 


GTTCTGGTGC CGGGATCGCG 


CAGGCCGCCG CCGGGACGGT 


5520 


CAACATTGGG 


GCCTCCGACG 


CCTATCTGTC GGAAGGTGAT 


ATGGCCGCGC ACAAGGGGCT 


5580 



GATGAACATC 


GCGCTAGCCA 


TCTCCGCTCA 


. GCAGGTCAAC 


TACAACCTGC 


CCGGAGTGAG 


5640 


CGAGCACCTC 


AAGCTGAACG 


GAAAAGTCCT 


GGCGGC CATG 


TAC CAGGGCA 


CCATCAAAAC 


5700 


CTGGGACGAC 


CCGCAGATCG 


CTGCGCTCAA 


CCCCGGCGTG 


AACCTGCCCG 


GCACCGCGGT 


5760 


AGTTCCGCTG 


CACCGCTCCG 


ACGGGTCCGG 


TGACACCTTC 


TTGTTCACCC 


AGTACCTGTC 


5820 


CAAGCAAGAT 


CCCGAGGGCT 


GGGG CAAGTC 


GCCCGGCTTC 


GGCACCACCG 


TCGACTTCCC 


5880 


GGCGGTGCCG 


GGTGCGCTGG 


GTGAGAACGG 


CAACGGCGGC 


ATGGTGACCG 


GTTGCGCCGA 


5940 


GACACCGGGC 


TGCGTGGCCT 


ATATCGGCAT 


CAGCTTCCTC 


GACCAGGCCA 


GTCAACGGGG 


6000 


ACTCGGCGAG 


GCCCAACTAG 


GCAATAGCTC 


TGGCAATTTC 


TTGTTGCCCG 


ACGCGCAAAG 


6060 


CATTCAGGCC 


GCGGCGGCTG 


GCTTCGCATC 


GAAAACCCCG 


GCGAACCAGG 


CGATTTCGAT 


6120 


GATCGACGGG 


CCCGCCCCGG 


ACGGCTACCC 


GATCATCAAC 


TACGAGTACG 


CCATCGTCAA 


6180 


CAACCGGCAA 


AAGGACGCCG 


CCACCGCGCA 


GACCTTGCAG 


GCATTTCTGC 


ACTGGGCGAT 


6240 


CACCGACGGC 


AACAAGGCCT 


CGTTCCTCGA 


CCAGGTTCAT 


TTCCAGCCGC 


TGCCGCCCGC 


6300 


GGTGGTGAAG 


TTGTCTGACG 


CGTTGATCGC 


GACGATTTCC 


AGCGCTGAGA 


TGAAGACCGA 


6360 


TGCCGCTACC 


CTCGCGCAGG 


AGGCAGGTAA 


TTTCGAGCGG 


ATCTCCGGCG 


ACCTGAAAAC 


6420 


CCAGATCGAC 


CAGGTGGAGT 


CGACGGCAGG 


TTCGTTGCAG 


GGCCAGTGGC 


GCGGCGCGGC 


6480 
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GGGGACGGCC GCCCAGGCCG CGGTGGTGCG CTTCCAAGAA GCAGCCAATA AGCAGAAGCA 6540 

GGAACTCGAC GAGATCTCGA CGAATATTCG TCAGGCCGGC GTCCAATACT CGAGGGCCGA S600 

CGAGGAGCAG CAGCAGGCGC TGTCCTCGCA AATGGGCTTT GTGCCCACAA CGGCCGCCTC 6660 

GCCGCCGTCG ACCGCTGCAG CGCCACCCGC ACCGGCGACA CCTGTTGCCC CCCCACCACC 6720 

GGCCGCCGCC AACACGCCGA ATGCCCAGCC GGGCGATCCC AACGCAGCAC CTCCGCCGGC 6780 

CGACCCGAAC GCACCGCCGC CACCTGTCAT TGCCCCAAAC GCACCCCAAC CTGTCCGGAT 684 0 

CGACAACCCG GTTGGAGGAT TCAGCTTCGC GCTGCCTGCT GGCTGGGTGG AGTCTGACGC 6900 

GGGOGACTTC GACTACSG! , ■"' , n»o<?7. /-~r>r*r~* » , 

<a>„^A. — w^owwi-^*.;* v_rto<_«AAfteC ACCGGGGACC CGCCATTTCC 6560 

CGGACAGCCG CCGCCGGTGG CCAATGACAC CCGTATCGTG CTCGGCCGGC TAGACCAAAA 7020 

GCTTTACGCC AGCGCCGAAG CCACCGACTC CAAGGCCGCG GCCCGGTTGG GCTCGGACAT 7080 

GGGTGAGTTC TATATGCCCT ACCCGGGCAC CCGGATCAAC CAGGAAACCG TCTCGCTTGA 714 0 

CGCCAACGGG GTGTCTGGAA GCGCGTCGTA TTACGAAGTC AAGTTCAGCG ATCCGAGTAA 7200 

GCCGAACGGC CAGATCTGGA CGGGCGTAAT CGGCTCGCCC GCGGCGAACG CACCGGACGC 7260 

CGGGCCCCCT CAGCGCTGGT TTGTGGTATG GCTCGGGACC GCCAACAACC CGGTGGACAA 7320 

GGGCGCGGCC AAGGCGCTGG CCGAATCGAT CCGGCCTTTG GTCGCCCCGC CGCCGGCGCC 73 80 

GGCACCGGCT CCTGCAGAGC CCGCTCCGGC GCCGGCGCCG GCCGGGGAAG TCGCTCCTAC 7440 

CCCGACGACA CCGACACCGC AGCGGACCTT ACCGGCCTGA GAATTCTGCA GATATCCATC 7500 

ACACTGGCGG CCGCTCGAGC ACCACCACCA CCACCACTGA GATCCGGCTG CTAACAAAGC 7560 

CCGAAAGGAA GCTGAGTTGG CTGCTGCCAC CGCTGAGCAA T AACT AG CAT AACCCCTTGG 7620 

GGCCTCTAAA CGGGTCTTGA GGGGTTTTTT GCTGAAAGGA GGAACTATAT CCGGAT 7676 

(2) INFORMATION FOR SEQ ID NO: 2 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 802 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 214: 

Met Gly His His His His His His Val He Asp He He Gly Thr Ser 
1 5 10 is 

Pro Thr Ser Trp Glu Gin Ala Ala Ala Glu Ala Val Gin Arg Ala Arg 
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20 25 30 

Asp Ser val Asp Asp He Arg Val Ala Arg Val He Glu Gin Asp Met 
35 40 45 

Ala Val Asp Ser Ala Gly Lys He Thr Tyr Arg lie- Lys Leu Glu Val 
50 55 60 

Ser Phe Lys Met Arg Pro Ala Gin Pro Arg Gly Ser Lys Pro Pro Ser 
65 7 <> 75 80 

Gly ser Pro Glu Thr Gly Ala Gly Ala Gly Thr Val Ala Thr Thr Pro 
85 90 95 

Ala Ser Ser Pro Val Thr Leu Ala Glu Thr Gly Ser Thr Leu Leu Tyr 

100 i.n.e tin 

" ■ u. ^. o 

Pro Leu Phe Asn Leu Trp Gly Pro Ala Phe His Glu Arg Tyr Pro Asn 
115 120 125 

Val Thr He Thr Ala Gin Gly Thr Gly Ser. Gly Ala Gly He Ala Gin 
130 135 140 

Ala Ala Ala Gly Thr Val Asn He Gly Ala Ser Asp Ala Tyr Leu Ser 
145 150 i 5 5 ' 16Q 

Glu Gly Asp Met Ala Ala His Lys Gly Leu Met Asn He Ala Leu Ala 
165 170 175 

He Ser Ala Gin Gin Val Asn Tyr Asn Leu Pro Gly Val Ser Glu His 
180 185 190 

Leu Lys Leu Asn Gly Lys Val Leu Ala Ala Met Tyr Gin Gly Thr He 
195 200 205 

Lys Thr Trp Asp Asp Pro Gin He Ala Ala Leu Asn Pro Gly Val Asn 
210 215 220 

Leu Pro Gly Thr Ala Val Val Pro Leu His Arg Ser Asp Gly Ser Gly 
225 230 235 " 240 

Asp Thr Phe Leu Phe Thr Gin Tyr Leu Ser Lys Gin Asp Pro Glu Gly 
245 250 255 

Trp Gly Lys Ser Pro Gly Phe Gly Thr Thr Val Asp Phe Pro Ala Val 
260 265 270 

Pro Gly Ala Leu Gly Glu Asn Gly Asn Gly Gly Met Val Thr Gly Cys 
275 280 285 

Ala Glu Thr Pro Gly Cys Val Ala Tyr He Gly He Ser Phe Leu Asp 
290 295 300 

Gin Ala Ser Gin Arg Gly Leu Gly Glu Ala Gin Leu Gly Asn Ser Ser 
305 310 31S ' 320 
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Gly Asn Phe Leu Leu Pro Asp Ala Gin Ser He Gin Ala Ala Ala Ala 
325 330 335 

Gly Phe Ala Ser Lys Thr Pro Ala Asn Gin Ala He Ser Met He Asp 
340 345 .. 350 

Gly Pro Ala Pro Asp Gly Tyr Pro He He Asn Tyr Glu Tyr Ala He 
355 360 365 

Val Asn Asn Arg Gin Lys Asp Ala Ala Thr Ala Gin Thr Leu Gin Ala 
370 375 380 

Phe Leu His Trp Ala He Thr Asp Gly Asn Lys Ala Ser Phe Leu Asp 
385 390 395 400 

Gin Val His Phe Gin Pro Leu Pro Pro Ala Val Val Lys Leu Ser Asp 
405 4!o 415 

Ala Leu He Ala Thr He Ser Ser Ala Glu Met Lys Thr Asp Ala Ala 
420 425 430 

Thr Leu Ala Gin Glu Ala Gly Asn Phe Glu Arg He Ser Gly Asp Leu 
435 440 445 

Lys Thr Gin He Asp Gin Val Glu Ser Thr Ala Gly Ser Leu Gin Gly 
450 455 46Q 

Gin Trp Arg Gly Ala Ala Gly Thr Ala Ala Gin Ala Ala Val Val Arg 
465 470 475 480 

Phe Gin Glu Ala Ala Asn Lys Gin Lys Gin Glu Leu Asp Glu He Ser 
485 490 495 

Thr Asn He Arg Gin Ala Gly Val Gin Tyr Ser Arg Ala Asp Glu Glu 
so ° 505 510 

Gin Gin Gin Ala Leu Ser Ser Gin Met Gly Phe Val Pro Thr Thr Ala 
515 520 525 

Ala Ser Pro Pro Ser Thr Ala Ala Ala Pro Pro Ala Pro Ala Thr Pro 
53 0 535 540 

Val Ala Pro Pro Pro Pro Ala Ala Ala Asn Thr Pro Asn Ala Gin Pro 
545 550 555 560 

Gly Asp Pro Asn Ala Ala Pro Pro Pro Ala Asp Pro Asn Ala Pro Pro 
565 570 575 

Pro Pro Val He Ala Pro Asn Ala Pro Gin Pro Val Arg He Asp Asn 
580 585 590 

Pro Val Gly Gly Phe Ser Phe Ala Leu Pro Ala Gly Trp Val Glu Ser 
59S 600 605 
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Asp Ala Ala His Phe Asp Tyr Gly Ser Ala Leu Leu Ser Lys Thr Thr 
610 615 620 

Gly Asp Pro Pro Phe Pro Gly Gin Pro Pro Pro Val Ala Asn Asp Thr 
625 630 635 640 

Arg He Val Leu Gly Arg Leu Asp Gin Lys Leu Tyr Ala Ser Ala Glu 
645 650 655 

Ala Thr Asp Ser Lys Ala Ala Ala Arg Leu Gly Ser Asp Met Gly Glu 
660 665 670 

Phe Tyr Met Pro Tyr Pro Gly Thr Arg He Asn Gin Glu Thr Val Ser 
675 680 685 

Leu Asp Ala Asn Gly Val Ser Gly Ser Ala Ser Tyr Tyr Glu Val Lys 
690 695 700 

Ms 

Q Phe Ser Asp Pro Ser Pro Asn G1 Y Gin He Trp Thr Gly Val He 

Q 705 715 ' * 720 

rrs 

1 Gly Ser Pro Ala Ala Asn A1 * Pro Asp Ala Gly Pro Pro Gin Arg Trp 

725 730 735 



Ly 



Phe Val Val Trp Leu Gly Thr Ala Asn Asn Pro Val Asp Lys Gly Ala 
740 745 75Q 



O Ala Lys Ala Leu Ala Glu Ser . He Arg Pro Leu Val Ala Pro Pro Pro 

LH 755 760 765 



Ala Pro Ala Pro Ala Pro Ala Glu Pro Ala Pro Ala Pro Ala Pro Ala 
P 770 775 780 



Gly Glu Val Ala Pro Thr Pro Thr Thr Pro Thr Pro Gin Arg Thr Leu 
785 790 795 800 

Pro Ala 

(2) INFORMATION FOR SEQ ID NO: 215: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 454 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2l5: 

GTGGCGGCGC TGCGGCCGGC CAGCAGAGCG ATGTGCATCC GTTCGCGAAC CTGATCGCGG 60 

TCGACGATGA GCGCGCCGAA CGCCGCGACG ACGAAGAACG TCAGGAAGCC GTCCAGCAGC 120 

GCGGTCCGCG CGGTGACGAA GCTGACCCCG TCGCAGATCA GCAGCACCCC GGCGATGGCG 180 

J™?™ TCGACCGGCT GATCCGCCGC ACGATCCGCA CCACCAGCGC CACCAGGACC 240 

ACACCCAGCA GGGCGCCGGT GAACCGCCAG CCGAATCCGT TGTGACCGAA GATGGCCTCC 300 
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CCGATCGCGA TCAGCTGCTT ACCGACCGGC GGGTGAACCA CCAGGCCGTA CCCGGGGTTG 360 
TCTTCCACCC CATGGTTGTT CAGCACCTGC CAGGCCTGGC GGTGCGTAAT GCTTCTCGTC 420 
GAAGATGGGG GTGCCGGCAT CCGTCACCGA GCCC 454 



(2) INFORMATION FOR SEQ ID NO:216: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 70 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 216: 



o 

5^? 



TGCAGAAGTA CGGCGGATCC TCGGTGGCCG ACGCCGAACG GATTCGCCGC GTCGCCGAAC 60 

GCATCGTCGC CACCAAGAAG CAAGGCAATG ACGTCGTCGT CGTCGTCTCT GCCATGGGGG 120 

AT AC CACCG A CGACCTGCTG GATCTGGCTC AGCAGGTGTG CCCGGCGCCG CCGCCTCGGG 18 0 

AGCTGGACAT GCTGCTTACC GCCGGTGAAC GCATCTCGAA TGCGTTGGTG GCCATGGCCA 240 

TCGAGTCGCT CGGCGCGCAT GCCCGGTCGT TCACCGGTTC GCAGGCCGGG GTGATCACCA 3 00 

CCGGCACCCA CGGCAACGCC AAGATCATCG ACGTCACGCC GGGGCGGCTG CAAACCGCCC 360 

TTGAGGAAGG GCGGGTCGTC TTGGTGGCCG GATTCCAAGG GGTCAGCCAG GACACCAAGG 42 0 

ATGTCACGAC GTTGGGCCGC GGCGGCTCGG ACACCACCGC CGTCGCCATG 4 70 

(2) INFORMATION FOR SEQ ID NO:217: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 79 base pairs 



m 

CO 

==== 

w 

s 

ru 

"iy (B) TYPE: nucleic acid 

111 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 217: 



GGCCGGCGTA CCCGGCCGGG ACAAACAACG ATCGATTGAT ATCGATGAGA GACGGAGGAA 60 

TCGTGGCCCT TCCCCAGTTG ACCGACGAGC AGCGCGCGGC CGCGTTGGAG AAGGCTGCTG 120 

CCGCACGTCG AGCGCGAGCA GAGCTCAAGG ATCGGCTCAA GCGTGGCGGC ACCAACCTCA 18 0 

CCCAGGTCCT CAAGGACGCG GAGAGCGATG AAGTCTTGGG CAAAATGAAG GTGTCTGCGC 240 

TGCTTGAGGC CTTGC CAAAG GTGGGCAAGG TCCAGGCGC 279 

(2) INFORMATION FOR SEQ ID NO: 218: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 219 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:218: 
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ACACGGTCGA ACTCGACGAG CCCCTCGTGG AGGTGTCGAC CGACAAGGTC GACACCGAAA 60 

TCCCTCGCCG GCCGCGGGTG TGCTGACCAA GATCATCGCC CAAGAAGATG ACACGGTCGA 120 

GGTCGGCGGC GAGCTCTCTG TCATTGGCGA CGCCCATGAT GCCGGCGAGG CCGCGGTCCC 180 

GGCACCCCAG AAAGTCTCTG CCGGCCCAAC CCGAATCCA 219 

(2) INFORMATION FOR SEQ ID NO: 219: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 342 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRXPTXON: SEQ ZD WOz213x 

{Us, TCGCTGCCGA CATCGGCGCC GCGCCCGCCC CCAAGCCCGC ACCCAAGCCC GTCCCCGAGC 6 0 

q CAGCGCCGAC GCCGAAGGCC GAACCCGCAC CATCGCCGCC GGCGGCCCAG CCAGCCGGTG 120 

Q CGGCCGAGGG CGCACCGTAC GTGACGCCGC TGGTGCGAAA GCTGGCGTCG GAAAACAACA 180 

g TCGACCTCGC CGGGGTGACC GGCACCGGAG TGGGTGGTCG CATCCGCAAA CAGGATGTGC 24 0 

% TGGCCGCGGC TGAACAAAAG AAGCGGGCGA AAGCACCGGC GCCGGCCGCC CAGGCCGCCG 3 00 

jjj CCGCGCCGGC CCCGAAAGCG CCGCCTGAAG ATCCGATGCC GC 342 



y (2) INFORMATION FOR SEQ ID NO: 220: 

Q 
ry 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 515 base pairs 

(B) TYPE: nucleic acid 
I- (C) STRANDEDNESS: single 
j£j (D) TOPOLOGY: linear 

rU (ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 220: 

GGGTCTTGGT CAGTATCAGC GCCGACGAGG ACGCCACGGT GCCCGTCGGC GGCGAGTTGG 60 

CCCGGATCGG TGTCGCTGCC GACATCGGCG CCGCGCCCGC CCCCAAGCCC GCACCCAAGC 12 0 

CCGTCCCCGA GCCAGCGCCG ACGCCGAAGG CCGAACCCGC ACCATCGCCG CCGGCGGCCC 18 0 

AGCCAGCCGG TGCGGCCGAG GGCGCACCGT ACGTGACGCC GCTGGTGCGA AAGCTGGCGT 24 0 

CGGAAAACAA CATCGACCTC GCCGGGGTGA CCGGCACCGG AGTGGGTGGT CGCATCCGCA 300 

AACAGGATGT GCTGGCCGCG GCTGAACAAA AGAAGCGGGC GAAAGCACCG GCGCCCTGAG 36 0 

CGCTTCATCA CCCGGTTAAC CAGCTTGCCC CAGAAGCCGG CTTCGACCTC TTCGCGGGTC 420 

TTGGTC CGCT GCAGGCGGTC GGCGAGCCAG TTCAGGTTAG GCGGCCGAAA TCTTCCAGTT 48 0 

CGCCAGGAAG GGCACCCGGA ACAGGGTCCG CACCC 515 

(2) INFORMATION FOR SEQ ID NO: 221: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 557 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:221: 



CCGACCCCAA GGTGCAGATT CAACAGGCCA TTGAGGAAGC ACAGCGCACC CACCAAGCGC 6 0 

TGACTCAACA GGCGGCGCAA GTGATCGGTA ACCAGCGTCA ATTGGAGATG CGACTCAACC 12 0 

GACAGCTGGC GGACATCGAA AAGCTTCAGG TCAATGTGCG CCAAGCCCTG ACGCTGGCCG 180 

ACCAGGCCAC CGCCGCCGGA GACGCTGCCA AGGCCACCGA ATACAACAAC GCCGCCGAGG 24 0 

CGTTCGCAGC CCAGCTGGTG ACCGCCGAGC AGAGCGTCGA AGACCTCAAG ACGCTGCATG 300 

ACCAGGCGCT TAGCGCCGCA GCTCAGGCCA AGAAGGC CGT CGAACGAAAT GCGATGGTGC 360 

TGCAGCAGAA GATCGCCGAG CGAACCAAGC TGCTCAGCCA GCTCGAGCAG GCGAAGATGC 420 

AGGAGCAGGT CAGCGCATCG TTGCGGTCGA TGAGTGAGCT CGCCGCGCCA GGCAACACGC 48 0 

CGAGCCTCGA CGAGGTGCGC GACAAGATCG AGCGTCGCTA CGCCAACGCG ATCGGTTCGG 540 

CTGAACTTGC CGAGAGT 557 



(2) INFORMATION FOR SEQ ID NO: 222: 



= Is? 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 222: 



3 CAGGATAGGT TTCGACATCC ACCTGGGTTC CGCACCCGGT GCGCGACCGT GTGATAGGCC 60 

Q AGAGGTGGAC CTGCGCCGAC C GACGATCG A TCGAGGAGTC AACAGAAATG GCCTTCTCCG 120 

fU TCCAGATGCC GGCACTCGGT GAGAGCGTCA CCGAGGGGAC GGTTACCCGC TGGCTCAAAC 180 

Py AGGAAGGCGA CACGGTCGAA CTCGACGAGC CCCTCGTGGA GGT 223 



(2) INFORMATION FOR SEQ ID NO: 223: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 578 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:223: 



AAGAAGTACA TCTGCCGGTC GATGTCGGCG AACCACGGCA GCCAACCGGC GCAGTAGCCG 60 

ACCAGGACCA CCGCATAACG CCAGTCCCGG CGCACAAACA TACGCCACCC CGCGTATGCC 120 

AGGACTGGCA CCGCCAGCCA CCACATCGCG GGCGTGCCGA CCAGCATCTC GGCCTTGACG 180 

CACGACTGTG CGCCGCAGCC TGCAACGTCT TGCTGGTCGA TGGCGTACAG CACCGGCCGC 24 0 

AACGACATGG GCCAGGTCCA CGGTTTGGAT TCCCAAGGGT GGTAGTTGCC TGCGGAATTC 3 00 

GTCAGGCCCG CGTGGAAGTG GAACGCTTTG GCGGTGTATT GCCAGAGCGA GCGCACGGCG 360 

TCGGGCAGCG GAACAACCGA GTTGCGACCG ACCGCTTGAC CGACCGCATG CCGATCGATC 420 

GCGGTCTCGG ACGCGAACCA CGGAGCGTAG GTGGCCAGAT AGACCGCGAA CGGGATCAAC 480 

CCCAGCGCAT ACCCGCTGGG AAGCACGTCA CGCCGCACTG TTCCCAGCCA CGGTCTTTGC 540 

ACTTGGTATG AACGTCGCGC CGCCACGTCA ACGCCAGC 578 



(2) INFORMATION FOR SEQ ID NO: 224: 



# 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 484 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 224: 

ACAACGATCG ATTGATATCG ATGAGAGACG GAGGAATCGT GGCCCTTCCC CAGTTGACCG 60 

ACGAGCAGCG CGCGGCCGCG TTGGAGAAGG CTGCTGCCGC ACGTCGAGCG CGAGCAGAGC 120 

TCAAGGATCG GCTCAAGCGT GGCGGCACCA ACCTCACCCA GGTCCTCAAG GACGCGGAGA 18 0 

GCGATGAAGT CTTGGGCAAA ATGAAGGTGT CTGCGCTGCT TGAGGCCTTG CCAAAGGTGG 24 0 

GCAAGGTCAA GSGGGAGGAG ATCATGACCG AGCTGGAAAT TGCGCCCCAC CCCGCCGCCT 300 

TCGTGGC CTC GGTGACCGTC AGCGCAAGGC CCTGCTGGAA AAGTTCGGCT CCGCCTAACC 36 0 

CCGCCGGCCG ACGATGCGGG CCGGAAGGCC TGTGGTGGGC GTACCCCCGC ATACGGGGGA 420 

q GAAGCGGCCT GACAGGGCCA GCTCACAATT CAGGCCGAAC GCCCCGGTGG GGGGGAACCC 480 

m GCCC 484 

(2) INFORMATION FOR SEQ ID NO: 22 5: 



m 
£ 

i 

o 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 



IU (ii) MOLECULE TYPE: Genomic DNA 

m 

□ (xi) SEQUENCE DESCRIPTION: SEQ ID NO:225: 

ru 

AGGACTGGCA CCGCCAGCCA CCACATCGCG GGCGTGCCGA CCAGCATCTC GGCCTTGACG 60 

CACGACTGTG CGCCGCAGCC TGCAACGTCT TGCTGGTCGA TGGCGTACAG CACCGGCCGC 120 

AACGACATGG GCCAGGTCCA CGGTTTGGAT TCCCAAGGGT GGTAGTTGCC TGCGGAATTC 180 

GTCAGGCCCG CGTGGAAGTG GAACGCTTTG GCGGTGTAGT GCCAGAGCGA GCGCACGGCG 240 

TCGGGCAGCG GAACAACCGA GTTGCGACCG ACCGCTTGAC CGACCGCATG CCGATCGATC 300 

GCGGTCTCGG ACGCGAACCA CGGAGCGTAG GTGGCCAGAT AGACCGCGAA CGGGATCAAC 360 

CCCAGCGCAT ACCCGCTGGG AAGCACGTCA CGCCGCACTG TCCCCAGCCA CGGTCTTTGC 42 0 

ACTTGGTACT GACGTCGCGC CGCCACGTCG AACGCCAGCG CCATCGCGCC GAAGAACAGC 4 80 

ACGAAGTACA CGCCGGACCA CTTGGTGGCG CAAGCCAATC CCAAGCAGCA CCCCGGC 537 

(2) INFORMATION FOR SEQ ID NO: 226: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 135 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:226: 



Gly Gly Ala Ala 
1 

Leu He Ala Val 
20 

Arg Gin Glu Ala 
35 

Pro Val Ala Asp 
50 

Pro Ala Asp Pro 
65 

Thr Gin Gin Gly 

Asp Gly Leu Pro 
100 

His Gin Ala Val 

115 

Leu Pro Gly Leu 
130 



Ala Gly Gin Gin 
5 

Asp Asp Glu Arg 

Val Gin Gin Arg 
40 

Gin Gin His Pro 
55 

Pro His Asp Pro 
70 

Ala Gly Glu Pro 
85 

Asp Arg Asp Gin 

Pro Gly Val Val 

120 

Ala Val Arg 
135 
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Ser Asp Val His 
10 

Ala Glu Arg Arg 
25 

Gly Pro Arg Gly 

Gly Asp Gly Ala 
60 

His His Gin Arg 
75 

Pro Ala Glu Ser 
90 

Leu Leu Thr Asp 
105 

Phe His Pro Met 



Pro Phe Ala Asn 
15 

Asp Asp Glu Glu 
30 

Asp Glu Ala Asp 
45 

Asp Gin Cys Arg 

His Gin Asp His 
80 

Val Val Thr Glu 
95 

Arg Arg Val Asn 
110 

Val Val Gin. His 
125 



(2) INFORMATION FOR SEQ ID NO: 227: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 156 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 



(ii) MOLECULE TYPE: protein 



(xi) SEQU1 

Gin Lys Tyr Gly 
1 

Val Ala Glu Arg 
20 

Val Val Val Ser 
35 

Ala Gin Gin Val 
50 

Leu Thr Ala Gly 
65 

Glu Ser Leu Gly 

Val He Thr Thr 
100 

Pro Gly Arg Leu 
115 

Ala Gly Phe Gin 
130 

Gly Arg Gly Gly 
145 



INCE DESCRIPTION 

Gly Ser Ser Val 
5 

He Val Ala Thr 

Ala Met Gly Asp 
40 

Cys Pro Ala Pro 
55 

Glu Arg He Ser 
70 

Ala His Ala Arg 
85 

Gly Thr His Gly 

Gin Thr Ala Leu 
120 

Gly Val Ser Gin 
135 

Ser Asp Thr Thr 
150 



SEQ ID NO: 227: 



Ala 


Asp 


Ala 


Glu 




10 






Lys 


Lys 


Gin 


Gly 


25 








Thr 


Thr 


Asp 


Asp 


Pro 


Pro Arg 


Glu 








60 


Asn 


Ala 


Leu 


Val 






75 




Ser 


Phe 


Thr 


Gly 




90 






Asn 


Ala 


Lys 


He 


105 








Glu 


Glu Gly 


Arg 


Asp 


Thr 


Lys 


Asp 








140 


Ala 


Val 


Ala 


Met 






155 





Arg He Arg Arg 
15 

Asn Asp Val Val 
30 

Leu Leu Asp Leu 
45 

Leu Asp Met Leu 

Ala Met Ala He 
80 

Ser Gin Ala Gly 
95 

He Asp Val Thr 
110 

Val Val Leu Val 
125 

Val Thr Thr Leu 



(2) INFORMATION FOR SEQ ID NO: 22 8: 



(i) SEQUENCE CHARACTERISTICS: 



# 
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(A) LENGTH: 92 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:228: 

Pro Ala Tyr Pro Ala Gly Thr Asn Asn Asp Arg Leu lie Ser Met Arg 

15 10 15 

Asp Gly Gly lie Val Ala Leu Pro Gin Leu Thr Asp Glu Gin Arg Ala 

20 25 30 

Ala Ala Leu Glu Lys Ala Ala Ala Ala Arg Arg Ala Arg Ala Glu Leu 

35 40 45 

Lys Asp Arg Leu Lys Arg Gly Gly Thr Asn Leu Thr Gin Val Leu Lys 

50 55 60 

Asp Ala Glu Ser Asp Glu Val Leu Gly Lys Met Lys Val Ser Ala Leu 
f=| 65 70 75 80 

p5j Leu Glu Ala Leu Pro Lys Val Gly Lys Val Gin Ala 

S 85 90 



W 



iU 

ry 

yn 



(2) INFORMATION FOR SEQ ID NO: 22 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 72 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

nJ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 229: 

Thr Val Glu Leu Asp Glu Pro Leu Val Glu Val Ser Thr Asp Lys Val 

1 5 10 15 

Asp Thr Glu lie Pro Ser Pro Ala Ala Gly Val Leu Thr Lys lie lie 

20 25 30 

Ala Gin Glu Asp Asp Thr Val Glu Val Gly Gly Glu Leu Ser Val lie 

35 40 45 

Gly Asp Ala His Asp Ala Gly Glu Ala Ala Val Pro Ala Pro Gin Lys 

50 55 60 

Val Ser Ala Gly Pro Thr Arg lie 
65 70 



(2) INFORMATION FOR SEQ ID NO: 230: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 113 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
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(Xi) SEQU1 

Ala Ala Asp lie 
1 

Val Pro Glu Pro 
20 

Pro Ala Ala Gin 
35 

Pro Leu Val Arg 
50 

Val Thr Gly Thr 
65 

Ala Ala Ala Glu 
Gin Ala Ala Ala 



!NCE DESCRIPTION 

Gly Ala Ala Pro 
5 

Ala Pro Thr Pro 

Pro Ala Gly Ala 
40 

Lys Leu Ala Ser 
55 

Gly Val Gly Gly 
70 

Gin Lys Lys Arg 
85 

Ala Pro Ala Pro 



SEQ ID NO: 230: 

Ala Pro Lys Pro 
10 

Lys Ala Glu Pro 
25 

Ala Glu Gly Ala 

Glu Asn Asn lie 
60 

Arg lie Arg Lys 
75 

Ala Lys Ala Pro 
90 

Lys Ala Pro Pro 

105 



Ala Pro Lys Pro 
15 

Ala Pro Ser Pro 
30 

Pro Tyr Val Thr 
45 

Asp Leu Ala Gly 

Gin Asp Val Leu 
80 

Ala Pro Ala Ala 
95 

Glu Asp Pro Met 

JJL0 



(2) INFORMATION FOR SEQ ID NO: 231: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 118 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:231: 



Val 


Leu 


Val 


Ser 


He 


Ser 


Ala 


Asp 


Glu 


Asp 


Ala 


Thr 


Val Pro 


Val Gly 


1 








5 










10 








15 


Gly Glu 


Leu 


Ala 


Arg 


He 


Gly 


Val 


Ala 


Ala 


Asp 


He 


Gly Ala 


Ala Pro 








20 










25 








30 




Ala 


Pro 


Lys 


Pro 


Ala 


Pro 


Lys 


Pro 


Val 


Pro 


Glu 


Pro 


Ala Pro 


Thr Pro 






35 










40 










45 




Lys 


Ala 


Glu 


Pro 


Ala 


Pro 


Ser 


Pro 


Pro 


Ala 


Ala 


Gin 


Pro Ala 


Gly Ala 




50 










55 










60 






Ala 


Glu 


Gly Ala 


Pro 


Tyr 


Val 


Thr 


Pro 


Leu 


Val 


Arg 


Lys Leu 


Ala Ser 


65 










70 










75 






80 


Glu 


Asn 


Asn 


He 


Asp 


Leu 


Ala 


Gly 


Val 


Thr 


Gly 


Thr Gly Val 


Gly Gly 










85 










90 








95 


Arg 


He 


Arg 


Lys 


Gin 


Asp 


Val 


Leu 


Ala 


Ala 


Ala 


Glu 


Gin Lys 


Lys Arg 








100 










105 








110 




Ala 


Lys 


Ala 


Pro 


Ala 


Pro 



















115 



(2) INFORMATION FOR SEQ ID NO:232: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 185 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:232: 



IPj 

P 

00 

•45 



hi 



ry 
ru 

in 

o 
ru 



Asp 


Pro 


Lys 


Val 


Gin 


He 


Gin 


Gin 


Ala 


He Glu Glu Ala Gin Arg Thr 


1 








5 










10 








15 




His 


Gin 


Ala 


Leu 


Thr 


Gin 


Gin 


Ala 


Ala 


Gin 


Val 


He 


Gly Asn 


Gin Arg 








20 










25 








30 






Gin 


Leu 


Glu 
35 


Met 


Arg 


Leu 


Asn 


Arg 
40 


Gin 


Leu 


Ala 


Asp 


He Glu 
45 


Lys 


Leu 


Gin 


Val 
50 


Asn 


Val 


Arg 


Gin 


Ala 
55 


Leu 


Thr 


Leu 


Ala 


Asp 
60 


Gin Ala 


Thr 


Ala 


Ala 


Gly Asp 


Ala 


Ala 


Lys 


Ala 


Thr 


Glu 


Tyr 


Asn 


Asn 


Ala Ala 


Glu 


Ala 


6 5 










70 










75 








80 


Phe 


Ala 


Ala 


Gin 


Leu 
85 


Val 


Thr 


-Ala 


Glu 


Gin 
90 


Ser 


Val 


Glu Asp 


Leu 
95 


Lys 


Thr 


Leu 


His 


Asp 
100 


Gin 


Ala 


Leu 


Ser 


Ala 
105 


Ala 


Ala 


Gin 


Ala Lys 
110 


Lys 


Ala 


Val 


Glu Arg 


Asn 


Ala 


Met 


Val 


Leu 


Gin 


Gin 


Lys 


He 


Ala Glu 


Arg 


Thr 






115 










120 










125 






Lys 


Leu 


Leu 


Ser 


Gin 


Leu 


Glu 


Gin 


Ala 


Lys 


Met 


Gin 


Glu Gin 


Val 


Ser 


130 










135 










140 








Ala 


Ser 


Leu 


Arg 


Ser 


Met 


Ser 


Glu 


Leu 


Ala 


Ala 


Pro 


Gly Asn 


Thr 


Pro 


145 










150 










155 








160 


Ser 


Leu 


Asp 


Glu 


Val 


Arg 


Asp 


Lys 


He 


Glu Arg 


Arg 


Tyr Ala 


Asn 


Ala 










165 










170 








175 




He 


Gly 


Ser 


Ala 
180 


Glu 


Leu 


Ala 


Glu 


Ser 
185 















(2) INFORMATION FOR SEQ ID NO: 23 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 71 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:233: 

Val Ser Thr Ser Thr Tfp Val Pro His Pro Val Arg Asp Arg Val He 

15 10 15 

Gly Gin Arg Trp Thr Cys Ala Asp Arg Arg Ser He Glu Glu Ser Thr 

20 25 30 

Glu Met Ala Phe Ser Val Gin Met Pro Ala Leu Gly Glu Ser Val Thr 

35 40 45 

Glu Gly Thr Val Thr Arg Trp Leu Lys Gin Glu Gly Asp Thr Val Glu 

50 55 60 

Leu Asp Glu Pro Leu Val Glu 
65 70 



(2) INFORMATION FOR SEQ ID NO: 2 34: 




217 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 182 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:234: 





Glu 


Val 


His Leu 


Pro 


Val 


Asp 


Val 


Gly 


Glu 


Pro 


Arg 


Gin 


Pro 


Thr Gly 




. 1 






5 










10 










15 




Ala 


Val 


Ala Asp 


Gin 


Asp 


His 


Arg 


He 


Thr 


Pro 


Val 


Pro 


Ala 


His Lys 








20 










25 










30 




His 


Thr 


Pro Pro 
35 


Arg 


Val 


Cys 


Gin 
40 


Asp 


Trp 


His 


Arg 


Gin 
45 


Pro 


Pro. His 




Arg 


Gly 
50 


Arg Ala 


Asp 


Gin 


His 
55 


Leu 


Gly 


Leu 


Asp 


Ala 
60 


Arg 


Leu 


Cys Ala 


o 


Ala 
65 


Ala 


Cys Asn 


Val 


Leu 
70 


Leu 


Val 


Asp 


Gly 


Val 
75 


Gin 


His 


Arg 


Pro Gin 
80 


00 


Arg 


His 


Gly Pro 


Gly 


Pro 


Arg 


Phe 


Gly 


Phe 


Pro 


Arg 


Val 


Val 


Val Ala 


s£ss 








85 










90 










95 


00 


Cys 


Gly 


lie Arg 


Gin 


Ala 


Arg 


Val 


Glu 


Val 


Glu 


Arg 


Phe 


Gly 


Gly Val 








100 










105 










110 




s " S : 

yj 


Leu 


Pro 


Glu Arg 


Ala 


His 


Gly 


Val 


Gly 


Gin 


Arg 


Asn 


Asn Arg 


Val Ala 








115 








120 










125 








Thr 


Asp 


Arg Leu 


Thr 


Asp 


Arg 


Met 


Pro 


He 


Asp 


Arg 


Gly Leu 


Gly Arg 


fU 

5 %S 

y « 




130 








135 










14 0 








Glu 


Pro 


Arg Ser 


Val 


Gly 


Gly 


Gin 


He 


Asp 


Arg 


Glu 


Arg Asp 


Gin Pro 


145 








150 










155 








160 


Gin 


Arg 


lie Pro 


Ala 


Gly 


Lys 


His 


Val 


Thr 


Pro 


His 


Cys 


Ser 


Gin Pro 




Arg 


Ser 


Leu His 


165 
Leu 


Val 








170 










175 



180 

(2) INFORMATION FOR SEQ ID NO: 235: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 235: 

Leu Pro 
15 

Ala Ala 
Gly Gly 
Val Leu 



Asn Asp Arg Leu He Ser Met 

1 5 
Gin Leu Thr Asp Glu Gin Arg 
20 

Ala Arg Arg Ala Arg Ala Glu 
35 

Thr Asn Leu Thr Gin Val Leu 
50 55 



Arg Asp Gly Gly He Val Ala 
10 

Ala Ala Ala Leu Glu Lys Ala 

25 30 
Leu Lys Asp Arg Leu Lys Arg 
40 45 
Lys Asp Ala Glu Ser Asp Glu 
60 



218 



Gly Lys Met Lys 
65 

Lys Val Lys Ala 

Pro Ala Ala Phe 
100 

Lys Ser Ser Ala 
115 

Gly Leu Trp Trp 
130 

Gly Pro Ala His 
145 



Val Ser Ala Leu 
70 

Gin Glu lie Met 
85 

Val Ala Ser Val 

Pro Pro Asn Pro 
120 

Ala Tyr Pro Arg 
135 

Asn Ser Gly Arg 
150 



Leu Glu Ala Leu 
75 

Thr Glu Leu Glu 
90 

Thr Val Ser Ala 
105 

Ala Gly Arg Arg 

lie Arg Gly Arg 
140 

Thr Pro Arg Trp 
155 



Pro Lys Val Gly 
80 

lie Ala Pro His 
95 

Arg Pro Cys Trp 
..110 

Cys Gly Pro Glu 
125 

Ser Gly Leu Thr 

Gly Gly Thr. Arg 
160 



(2) INFORMATION FOR SEQ ID NO: 236: 

It) SEQUENCE CHARACTEP-I S TXCS • 

(A) LENGTH: 178 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:236: 



Asp Trp His Arg Gin Pro Pro His Arg Gly Arg Ala Asp Gin His Leu 

15 io 15 

Gly Leu Asp Ala Arg Leu Cys Ala Ala Ala Cys Asn Val Leu Leu Val 

20 25 30 

Asp Gly Val Gin His Arg Pro Gin Arg His Gly Pro Gly Pro Arg Phe 

35 40 45 

Gly Phe Pro Arg Val Val Val Ala Cys Gly He Arg Gin Ala Arg Val 

50 55 60 

Glu Val Glu Arg Phe Gly Gly Val Val Pro Glu Arg Ala His Gly Val 
65 70 75 80 

Gly Gin Arg Asn Asn Arg Val Ala Thr Asp Arg Leu Thr Asp Arg Met 

85 90 95 

Pro He Asp Arg Gly Leu Gly Arg Glu Pro Arg Ser Val Gly Gly Gin 

100 105 no 

He Asp Arg Glu Arg Asp Gin Pro Gin Arg He Pro Ala Gly Lys His 

115 120 125 

Val Thr Pro His Cys Pro Gin Pro Arg Ser Leu His Leu Val Leu Thr 

130 135 140 

Ser Arg Arg His Val Glu Arg Gin Arg His Arg Ala Glu Glu Gin His 
145 150 155 160 

Glu Val His Ala Gly Pro Leu Gly Gly Ala Ser Gin Ser Gin Ala Ala 
165 170 175 

Pro Arg 



(2) INFORMATION FOR SEQ ID NO: 237: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 271 base pairs 

(B) TYPE: nucleic acid 
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yy 

sSss 

m 



iU 
to 
Q 



(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:237: 

ATGCCAAGCC GGTGCTGATG CCCGAGCTCG GCGAATCGGT GACCGAGGGG ACCGTCATTC 60 

GTTGGCTGAA GAAGATCGGG GATTCGGTTC AGGTTGACGA GCCACTCGTG GAGGTGTCCA 120 

-CCGACAAGGT GGACACCGAG ATCCCGTCCC CGGTGGCTGG GGTCTTGGTC AGTATCAGCG 180 

CCGACGAGGA CGCCACGGTG CCCGTCGGCG GCGAGTTGGC CCGGATCGGT GTCGCTGCCG 24 0 

AGATCGGCGC CGCGCCCGCC CCCAAGCCCC C 271 

(2) INFORMATION FOR SEQ ID NO: 23 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 





(ii) MOLECULE 


TYPE: protein 


















(xi) SEQUENCE 


DESCRIPTION 


: SEQ ID 


NO:! 


238 : 










Ala 


Lys 


Pro Val 


Leu 


Met 


Pro 


Glu 


Leu 


Gly 


Glu 


Ser 


Val 


Thr 


Glu Gly 


1 






5 










10 










15 




Thr 


Val 


He Arg 


Trp 


Leu 


Lys 


Lys 


He 


Gly 


Asp 


Ser 


Val 


Gin 


Val 


Asp 






20 










25 










30 






Glu 


Pro 


Leu Val 


Glu 


Val 


Ser 


Thr 


Asp 


Lys 


Val 


Asp 


Thr 


Glu 


He 


Pro 






35 








40 










45 








Ser 


Pro 


Val Ala 


Gly 


Val 


Leu 


Val 


Ser 


He 


Ser 


Ala 


Asp 


Glu 


Asp 


Ala 




50 








55 










60 








Thr 


Val 


Pro Val 


Gly 


Gly 


Glu 


Leu 


Ala 


Arg 


He 


Gly Val 


Ala 


Ala 


Glu 


65 








70 










75 










80 


He 


Gly 


Ala Ala 


Pro 


Ala 


Pro 


Lys 


Pro 






















85 

























(2) INFORMATION FOR SEQ ID NO: 23 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 107 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:239: 

GAGGTAGCGG ATGGCCGGAG GAGCACCCCA GGACCGCGCC CGAACCGCGG GTGCCGGTCA 60 
TCGATATGTG GGCACCGTTC GTTCCGTCCG CCGAGGTCAT TGACGAT 107 



(2) INFORMATION FOR SEQ ID NO: 24 0: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 339 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 240: 

ATGAAGTTGA AGTTTGCTCG CCTGAGTACT GCGATACTGG GTTGTGCAGC GGCGCTTGTG 60 

TTTCCTGCCT CGGTTGCCAG CGCAGATCCA CCTGACCCGC ATCAGCCGGA CATGACGAAA 120 

GGCTATTGCC CGGGTGGCCG ATGGGGTTTT GGCGACTTGG CCGTGTGCGA CGGCGAGAAG 180 

TACCCCGACG GCTCGTTTTG GCACCAGTGG ATGCAAACGT GGTTTACCGG CCCACAGTTT 240 

TACTTCGATT GTGTCAGCGG CGGTGAGCCC CTCCCCGGCC CGCCGCCACC GGGTGGTTGC 300 

GGTGGGGCAA TTCCGTCCGA GCAGCCCAAC GCTCCCTGA 33 9 



M (2) INFORMATION FOR SEQ ID NO:241: 

O 

Q (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 112 amino acids 

^ (B) TYPE: amino acid 

fQ (C) STRANDEDNESS: single 

Jz (D) TOPOLOGY: linear 

UJ 



- (ii) MOLECULE TYPE: protein 

jjj (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 241: 

If? 2 



m 



Met 


Lys 


Leu 


Lys 


Phe 


Ala 


Arg 


Leu 


Ser 


Thr 


Ala 


He Leu Gly Cys 


Ala 


1 








5 










10 




15 




Ala 


Ala 


Leu 


Val 


Phe 


Pro 


Ala 


Ser 


Val 


Ala 


Ser 


Ala Asp Pro Pro 


Asp 








20 










25 






30 




Pro 


His 


Gin 


Pro 


Asp 


Met 


Thr 


Lys 


Gly 


Tyr 


Cys 


Pro Gly Gly Arg 


Trp 






35 










40 








45 




Gly 


Phe 


Gly 


Asp 


Leu 


Ala 


Val 


Cys 


Asp 


Gly 


Glu 


Lys Tyr Pro Asp 


Gly 




50 










55 










60 




Ser 


Phe 


Trp 


His 


Gin 


Trp 


Met 


Gin 


Thr 


Trp 


Phe 


Thr Gly Pro Gin Phe 


65 










70 










75 




80 


Tyr 


Phe 


Asp 


Cys 


Val 


Ser 


Gly 


Gly 


Glu 


Pro 


Leu 


Pro Gly Pro Pro 


Pro 










85 










90 




95 




Pro 


Gly 


Gly 


Cys 


Gly 


Gly 


Ala 


He 


Pro 


Ser 


Glu 


Gin Pro Asn Ala 


Pro 








100 










105 






110 





(2) INFORMATION FOR SEQ ID NO: 242: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 71 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 242: 
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GTGACCACGG TGGGCCTGCC ACCAACCCGG GCAGCGGCAG CCGCGGCGGC GCCGGCGGCT 60 

CCGGCGGCAA CGGTGGCGCC GGGGGTAACG CCACCGGCTC AGGCGGCAAG GGCGGCGCCG 120 

GTGGCAATGG CGGTGATGGG AGCTTCGGCG CTACCAGCGG CCCCGCCTCC ATCGGGGTCA 180 

CGGGCGCCCC CGGCGGCAAC GGCGGCAAGG GCGGCGCCGG TGGCAGCAAC CCCAACGGCT 24 0 

CAGGTGGCGA CGGCGGCAAA GGCGGCAACG GCGGTGCCGG CGGCAACGGG GGCTCGATCG 3 00 

GCGCCAACAG CGGCATCGTC GGCGGTTCCG GTGGGGCCGG TGGCGCTGGC GGCGCCGGCG 360 
GAAACGGCAG C 



(2) INFORMATION FOR SEQ ID NO: 24 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 424 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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M 3 (ii) MOLECULE TYPE: CDNA 

O (xi) SEQUENCE DESCRIPTION: SEQ ID NO:243: 

s a j 

*p GTCCGGGTCC CACCACCGCG CCGGCGCGCC CCTAGCGGCC GGGCGCACCA GCCCCTTTTC 60 

@Q TTGACTCGTT CAAGAAAAGG GCCTTCTGTT TGGTCGGCCA TGTTGGCATG ATCGTG AC CC 12 0 

jp, ATGGGCAACA TCGACGTCGA CATCTCGGCC AAGGTCTAGC TCCATGCGAA TCGCCGCCGC 180 

W GGTGGTGAGC ATCGGTCTAG CCGTCATAGC AGGGTTCGCG GTACCTGTTG CCGACGCACA 24 0 

e CCCGTCGGAG CCCGGGGTTG TGTCCTACGC GGTGCTCGGA AAGGGGTCGG TCGGCAACAT 300 

Q CGTCGGCGCC CCAATGGGGT GGGAGGCGGT GTTCACCAAG CCGTTCCAGG CGTTTTGGGT 360 

fU CGAACTACCG GCGTGCAACA ACTGGGTGGA CATCGGGCTG CC CGAGGTGT ACGACGATCC 42 0 

RJ CGAC ' 424 

2 5=5 

y ; 

f=~ s (2) INFORMATION FOR SEQ ID NO: 244: 

ru 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 317 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 244 : 

GCGATGGCGG CCGCGGGTAC CACCGCCAAT GTGGAACGGT TTCCCAACCC CAACGATCCT 60 

TTGCATCTGG CGTCAATTGA CTTCAGCCCG GCCGATTTCG TCACCGAGGG CCACCGTCTA 12 0 

AGGGCGGATG CGATCCTACT GCGCCGTACC GACCGGCTGC CTTTCGCCGA GCCGCCGGAT 18 0 

TGGGACTTGG TGGAGTCGCA GTTGCGCACG ACCGTCACCG CCGACACGGT GCGCATCGAC 24 0 

GTCATCGCCG ACGATATGCG TCCCGAACTG GCGGCGGCGT CCAAACTCAC CGAATCGCTG 300 

CGGCTCTACG ATTCGTC 317 

(2) INFORMATION FOR SEQ ID NO:245: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 422 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 245: 



TGGCGTATGC GCTTCGCAGC CGGTGCCGCG 
CGAGGAATGG TTCGATCACG ATCGCAGTGT 
TGAACTGAGG GCGGAAAATC GGCCGAAATC 
GTTCTGGAAG TTGGGTGCGC GCTTCTCGGC 
GGACAGGAAG ACCTTGATGC CGATCTGGGT 
GCACTCGGTC TCGCGGATGG AC CGCAAGAT 
GATGGCGTCG GCAAGTTCTA GAACCTTGGT 
AT 



TCAACGCGCC GGAGGCAATC GCTTCGCTGC 
GCCGTCGTGC ACCGACACCG CCGTCCAACG 
TCGCCCTCAG TTCACGCTCG GCGCCTAACG 
GAACGCGCGC GGGCCTTCCT TGGCGTCGTC 
GTCGATCTTG AACGCCTCGT TTTCGGGCAT 
GGC CTGCACG GCCAGGGGTC CGTTAGCCGA 
CAACGC CTGG CCGTCGGGCA CACGTGGCCG 



{2) JCNFORMATION FOR SEQ TD NO : 246: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 426 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:246: 



GCGTGCCGCT GAACACCAGC CCGCGGCTGC CAGATCTCCC GGACTCGGTA GTGCCGCCGG 
TGGCGTCGTT GCTCTCCTGA CGGGGCGCGG CGACCATAAG GTCGCTAATG CCCAGGTAGC 
GGCCCAGGTG CATGGAGTCG ATGATGATGC GACTCTCCAG CTCGCCGACC GGGAGCTTGG 
CATCGGGCCT GATCAGCCAG GACGCGTAGG ACAAGTCGAT CGAATGCATA GTGGCCTCCA 
GAGTGGCCGT GCCACTTCCG GCGTGCTCCA CGGCAAATGC CTTGATTTCT AGCTCCGCGT 
AGTGTTCCCG CATCGCCTGC GGGATGAATG GGAACCGCAG GATGGCGACA AACGGGTCTG 
ACCTCAGGTT TGCCGCTTTG CGCACAGTGG TCGACAGCCG GTACTCGGCA TAAATGCTGG 
CCCCGA 

(2) INFORMATION FOR SEQ ID NO: 247: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 327 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:247: 



AGACCGGCGA GGGTGTGGTC 
CGCCGATCAA CTAGTGAGGC 
TCAAAGAAAC GAAGAAGGTT 
CCGAACCCCT GACCAAGACC 
TCGACATCAA ATTCGCCGGA 
GGCAACCGAA TTTACCTGTG 



GCTGCCCGCG GCATTGTCGA 
GCAACGCTAG GCTTTGGGAT 
GCCATGAGCA CTGTTGCCGC 
ACGATCACCC GTCGCGACCC 
ATCTGTCGCT CGGACATCCA 
GTCCCTG 



TAATCTGCGC TGGGTCGACG 
ACCCACAGCT AAAAAGTTTA 
CTACGCCGCC ATGTCGGCGA 
GGGCCCGCAC GACATGGCGA 
TACCGTC CAA ACCGAATGGG 
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(2) INFORMATION FOR SEQ ID NO:248: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 123 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 248: 



Asp 


His 


Gly 


Gly 


1 








Ala 


Gly 


Gly 


Ser 








20 


Ser 


Gly 


Gly 


Lys 






35 




Gly 


Ala 


Thr 


Ser 




50 






Gly 


Asn 


Gly 


Gly 


65 








Gly 


Gly 


Asp 


Gly 


Gly 


Ser 


He 


Gly 








100 


Gly 


Gly 


Ala 


Gly 
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Pro Ala Thr Asn 
5 

Gly Gly Asn Gly 

Gly Gly Ala Gly 
40 

Gly Pro Ala Ser 
55 

Lys Gly Gly Ala 
70 

Gly Lys Gly Gly 
85 

Ala Asn Ser Gly 

Gly Ala Gly Gly 
120 



Pro 


Gly Ser 


Gly 




10 




Gly 


Ala Gly 


Gly 


25 






Gly 


Asn Gly 


Gly 


He 


Gly Val 


Thr 






60 


Gly 


Gly Ser 


Asn 




75 




Asn 


Gly Gly 


Ala 




90 




He 


Val Gly 


Gly 


105 






Asn 


Gly Ser 





Ser Arg Gly Gly 
15 

Asn Ala Thr Gly 
30 

Asp Gly Ser Phe 
45 

Gly Ala Pro Gly 

Pro Asn Gly Ser 
80 

Gly Gly Asn Gly 
95 

Ser Gly Gly Ala 
110 



(2) INFORMATION FOR SEQ ID NO: 249: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 104 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:249: 

Met Ala Ala Ala Gly Thr Thr Ala Asn Val Glu Arg Phe Pro Asn Pro 

15 10 15 

Asn Asp Pro Leu His Leu Ala Ser He Asp Phe Ser Pro Ala Asp Phe 

20 25 30 

Val Thr Glu Gly His Arg Leu Arg Ala Asp Ala He Leu Leu Arg Arg 

35 * 40 45 

Thr Asp Arg Leu Pro Phe Ala Glu Pro Pro Asp Trp Asp Leu Val Glu 

50 55 60 

Ser Gin Leu Arg Thr Thr Val Thr Ala Asp Thr Val Arg He Asp Val 
65 70 75 80 

He Ala Asp Asp Met Arg Pro Glu Leu Ala Ala Ala Ser Lys Leu Thr 

85 90 95 

Glu Ser Leu Arg Leu Tyr Asp Ser 
100 
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(2) INFORMATION FOR SEQ ID NO: 250: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:250: 

Ala Tyr Ala Leu Arg Ser Arg Cys Arg Val Asn Ala Pro Glu Ala lie 

1 5 io is . 

Ala Ser Leu Pro Arg Asn Gly Ser He Thr He Ala Val Cys Arg Arg 

20 25 30 

Ala Pro Thr Pro Pro Ser Asn Val Asn 
35 40 

(2) INFORMATION FOR SEQ ID NO: 251: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 251: 

Val Pro Leu Asn Thr Ser Pro Arg Leu Pro Asp Leu Pro Asp Ser Val 

15 10 is 

Val Pro Pro Val Ala Ser Leu Leu Ser 
20 25 

(2) INFORMATION FOR SEQ ID NO: 2 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 61 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:252: 

Met Ser Thr Val Ala Ala Tyr Ala Ala Met Ser Ala Thr Glu Pro Leu 

1 5 io 15 

Thr Lys Thr Thr He Thr Arg Arg Asp Pro Gly Pro His Asp Met Ala 

20 25 30 

He Asp He Lys Phe Ala Gly He Cys Arg Ser Asp He His Thr Val 
35 40 45 
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Gin Thr Glu Trp Gly Gin Pro Asn Leu Pro Val Val Pro 
50 55 60 

(2) INFORMATION FOR SEQ ID NO: 2 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 213 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 253: 

GCTTGGAGCC CTGGAGCGAC GGTGTGGGTC TGGGGGTCGA TTCGTTCTCG GCGAAAGTCA 

AcTAAAGACC ACGTTGACAC CCAACCGGCG GCCCGGCATG GGCCGTCGCG GCGTAGAAGC 

TTTGACCGCG GCGCGAAACG TTCGCTGCTG CGGCCCATGC AGATCGCACA CGCTTGCTTG 
AACATCGGGT GGAGCCGGTG GTAACGCCAG GCT 

(2) INFORMATION FOR SEQ ID NO: 2 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 367 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 254: 

CCGAGCTGCT GTTCGGCGCC GGCGGTGCGG GCGGCGCGGG TGGGGCGGGC ACCGACGGCG 
GGCCCGGTGC TACCGGCGGG AC CGG CGGAC ACGGCGGAGT CGGCGGCGAC GGCGGATGGC 
TGGCACCCGG CGGGGCCGGC GGGGCCGGCG GGCAAGGCGG GGCAGGTGGT GCCCGCAGCG 
ATGGTGGCGC GTTGGGTGGT ACCGGCGGGA CGGGCGGTAC CGGCGGCGCC GGTGGCGCCG 
GCGGTCGCGG CACACTGCTG CTGGGCGCTG GCGGACAGGG CGGCCTCGGC GGCGCCGGCG 
GACAAGGCGG CACCGGCGGG GGCCGGCGGA GATGGCGTTC TGGGGGGTGT CAGTGGCACT 
GGTGGTA 

(2) INFORMATION FOR SEQ ID NO: 255: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 420 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 255: 

AAGGCGTGAT TGGCAAGGCG ACCGCGCAGC GGCCCGTAGC CGCGGGACGG CCCAGGCCCC 
GACCGCAGCG GCCGGTGTCT GACCGGGTCA GCGACCAGCG GCGCTGACCG TGCCGCTCGT 
CTACTTCGAC GCCAGCGCCT TCGTCAAACT TCTCACCACC GAGACAGGGA GCTCGCTGGC 
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GTCCGCTCTA TGGGACGGCT GCGACGCCGC ATTGTCCAAC CGCCTGGCCT ACCCCGAAGT 
CCGCGCCGCA CTCGCTGCAA CGGGCCGCAA TCACGACCTA AC CGAATC CG AGCTCGCCGA 
CGCCGAGCGT GACTGGGAGG ACTTCTGGGC CGCACCCGCC CAGTCGAACT CACCGCGACG 
GTTGAACAGC ACGCCGGGCA CCTCGCCCGA ACACATGCCT TACGCGGAGC CGACACCGTT 

(2) INFORMATION FOR SEQ ID NO: 2 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 299 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:256: 



CTCTTGTCGG TGGCATCGGC GGTACCGGCG 
GCGCCGCCGG GGCCGGCGGT GCCGGCGGGT 
GCGCCGGCGG GGCCGGTGGG CTGTTCACCA 
GTCACACGGG CGGGGCGGGC GGCGCCGGCG 
TGGGCGGGGC GGGCGGATTC GGGGATCACG 



GAACCGGCGG CAACGCCGGT ATGCTCGCCG 
TCAGCTTCAG CACTGCCGGT GGGGCTGGCG 
CCGGCGGTGT CGGCGGCGCC GGTGGGCAGG 
GGGCCGGCGG GTTGTTTGGT GCCGGCGGCA 
GAACGCTCGG CACCGGCGGG GCCGGCGGG 



(2) INFORMATION FOR SEQ ID NO: 257: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:257: 

Leu Glu Pro Trp Ser Asp Gly Val Gly Leu Gly Val Asp Ser Phe Ser 

15 10 15 

Ala Lys Val Asn 
20 

(2) INFORMATION FOR SEQ ID NO: 258: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 121 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:258: 

Glu Leu Leu Phe Gly Ala Gly Gly Ala Gly Gly Ala Gly Gly Ala Gly 

1 5 10 ' 15 

Thr Asp Gly Gly Pro Gly Ala Thr Gly Gly Thr Gly Gly His Gly Gly 
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20 






25 




30 


Val Gly 


Gly 


Asp 


Gly 


Gly 


Trp Leu Ala 


Pro 


Gly Gly Ala Gly Gly Ala 




35 








40 




45 


Gly Gly 


Gin 


Gly 


Gly 


Ala 


Gly Gly Ala 


Arg 


Ser Asp Gly Gly Ala Leu 


50 










55 




60 


Gly Gly 


Thr 


Gly 


Gly 


Thr 


Gly Gly Thr 


Gly 


Gly Ala Gly Gly Ala Gly 


65 








70 






75 80 


Gly Arg 


Gly 


Thr 


Leu 


Leu 


Leu Gly Ala 


Gly 


Gly Gin Gly Gly Leu Gly 








85 






90 


95 


Gly Ala 


Gly 


Gly 


Gin 


Gly 


Gly Thr Gly 


Gly 


Gly Arg Arg Arg Trp Arg 






100 






105 




110 


Ser Gly 


Gly 


Cys 


Gin 


Trp 


His Trp Trp 








115 








120 







(2) INFORMATION FOR SEQ ID NO: 259: 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 259: 

Gly Val lie Gly Lys Ala Thr Ala Gin Arg Pro Val Ala Ala Gly Arg 

1 5 io 15 

Pro Arg Pro Arg Pro Gin Arg Pro Val Ser Asp Arg Val Ser Asp Gin 
20 25 30 

Arg Arg 



(2) INFORMATION FOR SEQ ID NO: 260: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 99 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:260: 



Leu Val Gly Gly lie Gly Gly Thr 

1 5 
Met Leu Ala Gly Ala Ala Gly Ala 
20 

Ser Thr Ala Gly Gly Ala Gly Gly 

35 40 
Thr Thr Gly Gly Val Gly Gly Ala 

50 55 
Ala Gly Gly Ala Gly Gly Ala Gly 
65 70 



Gly Gly Thr Gly Gly Asn Ala Gly 

10 15 
Gly Gly Ala Gly Gly Phe Ser Phe 
25 , 30 

Ala Gly Gly Ala Gly Gly Leu Phe 
45 

Gly Gly Gin Gly His Thr Gly Gly 
60 

Gly Leu Phe Gly Ala Gly Gly Met 
75 80 
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Gly Gly Ala Gly Gly Phe Gly Asp His Gly Thr Leu Gly Thr Gly Gly 



Ala Gly Gly 



(2) INFORMATION FOR SEQ ID NO: 261: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 82 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:261: 

TCCTGTTCGG CGCCGGCGGG GTGGGCGGTG TTGGCGGTGA CGGTGTGGCA TTCCTGGGCA 60 

CCGCCCCCGG CGGGCCCGGT GGTGCCGGCG GGGCCGGTGG GCTGTTCAGC GTCGGTGGGG 120 

CCGGCGGCGC CGGCGGAATC GGATTGGTCG GGAACAGCGG TGCCGGGGGG TCCGGCGGGT 18 0 

CCGCCCTGCT CTGGGGCGAC GGCGGTGCCG GCGGCGCGGG TGGGGTCGGG TCCACTACCG 24 0 

GCGGTGCCGG CGGGGCGGGC GGCAACGCCA GCCTGCTGGT AA 282 

(2) INFORMATION FOR SEQ ID NO: 2 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 415 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:262: 

CGGCACGAGC CGTGCTACTG GTCAACTGAT GCCCTGATTG TGACCTTCCC GGCGCCGGAT 60 

CAGTGCTTCT CAGGACCGAC GTAATATTCG AAAACCAATC CGGCCGCCGA GGCGAGGATG 120 

AATGCCACAC CGGCGGCGAT CAGCCACGGG AGCCACAACG CGATGCCGAC CGCTGCCACC 180 

GAGCCGGACA ACGCGACCAT GATCGGCCAC CAGCTATGCG GACTGAAGAA TCCAAGTTCT 240 

CCTGCGCCGT CGCTGATTTC AGCGCCTTCG TAGTCCTCGG GCCGGGAATC TAACCGGCGG 300 

GCCACAAACC GGAAGAAGGT GGCGACGATC AACGCCATGC CGCCGGTGAG CGCCAACGCA 360 

ATGGTGCCAG CCCACTCGAC ACCACCGGTG GCGAACATCG AGGTCAACAC GCCGT 415 



(2) INFORMATION FOR SEQ ID NO:263: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 373 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 



85 



90 



95 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:263: 



00 
JG 
hi 
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TCACCGCGTG AACGGTTCGT AACACTGATA CGTATGCTTG TCAGCGAGCA GATCAAGTCC 60 

AGTCCGACCA ATGCCAGGAG ATCATCGGCT AGGCTCACGG TTTCGCCTGG GACGAGACGG 120 

TATTGAGTTC TGGCGTTGGA CGGTCCGTGG CGTGGTGGGA AGTCTGACGC GGCATCAGAA 180 

CGGTTGTCAA TACCAGTCTT TGGGGGATAT GGCCTATTTG GTGTCGTCGG GCCGCTCCAC 24 0 

CGGATCCCTT TTCGAACGTT GCGCAAGCGC GGTCCAGTTA CGGCCTGTTC ACTGCGCGCT 300 

GGCGTAGCTG CGCGGCCTCG ATCGGTTTGA ACGTCATCGC AATTCCCGCA ATGGGTGAGT 360 

ACCTGACGCT CCT 373 

(2) INFORMATION FOR SEQ ID NO: 264: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 423 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

" (xi) SEQUENCE DESCRIPTION: SEQ ID NO:264: 

a 

m CCAAACCGGA CAGGCCGGCA GCGACGGTCG GAAGTTGCAC CACGGTGCGC GCTCCATGTA 60 

GCCAACCGGT GACCACGGCG TAGACAGCAG ATCCGTGGAT CGCGCGTTCG GTGTCGTCCG 12 0 

GGCCGAGTAC CCGCGGGCCG AAC CGCAGCG ACCAAAGCAA CGCGATCGAT ACGGGGATCG 18 0 

CCACTCGTGC CGAATTCGAG CTCCGTCGAC AAGCTTGCGG CCGCACTCGA ACCCGGGTGA 24 0 

ATGATTGAGT TTAAACCGCT TAGCAATAAC TAGCATAACC CCTTGGGGCC TCTAAACGGG 300 

~ TCTTGAGGGG TTTTTTGCTG AAAGGAGGAA CTATATCCGG ATAAC CTGGC GTAGTAGCGA 36 0 

AGAGGCCCGC ACCGATCGCC CTTCCCAACA GTTGCGCAGC CTGAATGGCG AATGGACGCG 42 0 

U CCC 423 

iy 

fU (2) INFORMATION FOR SEQ ID NO: 265: 

in 

P (i) SEQUENCE CHARACTERISTICS: 

RJ (A) LENGTH: 404 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:265: 

AGTGGCCAGC CGGTCGGCCA ATGCATCCAG CTCCCGGTAC GTCAGCTGAC CATCCGCCCA 6 0 

ACTGACCGCC ACCGAGTCAG GCTGTGCCGC AGCGATTTCG GCGAACCGGG TATGCACCGC 12 0' 

GGGTGCCGAC GTCGTCACAT CCGGCAGGCC GGGTGCGGTC GGATCGTGCT CGCCGTCCAG 18 0 

CAGAATGTCG ACGTCGCGCA GCGGCCGATC CCACCGGCTG ACCAAGCGCT GTAACACAGC 24 0 

CAGCACCCGC CTGCCGAGGC TTTCGGGCGC CATCGTGCCC AGCGCACCGT CGAGCACCTC 300 

CACTAGCAGC GTGAGCTCAC CGGTGCTGCG GTGCGCGGCG ACGGTCACCG GAAAGTGCGA 36 0 

CAAACTCTCT AGCGCCACCG GACGGAACGT CACCCCGTTT GCGA 404 

(2) INFORMATION FOR SEQ ID NO: 266: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 421 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 266: 

GTCCTGGTCG CAGGCTGTTC TTCGAACCCG CTGGCTAACT TCGCACCCGG GTATCCGCCC 60 

ACCATCGAAC CCGCCCAACC GGCGGTGTCA CCGCCTACTT CGCAAGACCC GGCCGGTGCA 120 

GTGCGACCAC TGAGCGGCCA CCCCCGGGCG GCACTATTCG ACAACGGCAC CCGCCAATTG 180 

GTGGCTCTGC GCCCGGGCGC CGATTCGGCG GCACCCGCCA GCATCATGGT CTTCGATGAC 240 

ATGCACGTTG CACCGCGCGT CATTTTTCTG CCGGGCCCGG CAGCCGCGTT GACCAGCGAC 3 00 

GACCACGGCA CGGCCTTCCT TGCCGCCCGC GGCGGCTACT TCGTGGCCGA CCTGTCCTCC 360 

GGTCACACCG CACGAGTGAA TGTCGCTGAC GCAGCGCACA CCGATTTCAC CGCGATCGCC 420 

C 421 

(2) INFORMATION FOR SEQ ID NO: 2 67: 

^ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 426 base pairs 
S (B) TYPE: nucleic acid' 

S£ (C) STRANDEDNESS: single 

^ (D) TOPOLOGY: linear 

s 

rrs 

~ (ii) MOLECULE TYPE: cDNA 

^ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 67: 

O ATGCATATCA CGCTCAACGC CATCCTGCGT GCGATCTTCG GGGCCGGCGG CAGTGAACTA 60 

GACGAGCTGC GCCGCCTCAT TCCGCCGTGG GTCACGCTGG GCTCGCGCCT GGCGGCGCTA 120 

fU CCGAAACCCA AACGCGACTA TGGCCGCCTT AGCCCGTGGG GCCGGCTGGC CGAGTGGCGG 180 

m CGCCAGTACG ACACTGTCAT CGACGAGCTC ATCGAAGCCG AGCGGGCCGA CCCGAACTTC 240 < 

O GCCGATCGGA CCGACGTTTT GGCGTTGATG CTGCGCAGCA CTTACGACGA CGGTTCCATC 3 00 

FU ATGTCGCGCA AGGACATTGG CGACGAACTG CTCACGCTGC TTGCCGCCGG GCACGAAACC 360 

ACGGCGGCGA CATGGGCTGG GCGTTCGAAC GGCTCAACCG GCACCCCGAC GTGCTCGCGG 42 0 

CTCTGG 42 6 

(2) INFORMATION FOR SEQ ID NO: 268: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 522 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:268: 

GTCCTGGTCG CAGGCTGTTC TTCGAACCCG CTGGCTAACT TCGCACCCGG GTATCCGCCC 60 

ACCATCGAAC CCGCCCAACC GGCGGTGTCA CCGCCTACTT CGCAAGACCC GGCCGGTGCA 120 

GTGCGACCAC TGAGCGGCCA CCCCCGGGCG GCACTATTCG ACAACGGCAC CCGCCAATTG 180 

GTGGCTCTGC GCCCGGGCGC CGATTCGGCG GCACCCGCCA GCATCATGGT CTTCGATGAC 24 0 

GTGCACGTTG CACCGCGCGT CATTTTTCTG CCGGGCCCGG CAGCCGCGTT GACCAGCGAC 300 

GACCACGGCA CGGCCTTCCT TGCCGCCCGC GGCGGCTACT TCGTGGCCGA CCTGTCCTCC 360 

GGTCACACCG CACGAGTGAA TGTCGCTGAC GCAGCGCACA CCGATTTCAC CGCGATCGCC 420 
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CGCCGCTCCG ACGGCAAGCT GGTGCTGGGC AGCGCAGATG GCGCCGTCTA CACGCTTGCC 
AAGAACCCGC AGTTGACCGG CGTCGGCGCC GCCACCGTAG CC 

(2) INFORMATION FOR SEQ ID NO: 269: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 73 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 269: 

GCTGGGGCGC ACCGCCGTCC GGCGGCCCCA GCCCCTGGGC CCAGACCCCG CGCAAAACCA 
ACCCGTGGCC CTTAGTGGCC GGCGCCGCCG CCGTCGTGCT CGTCCTCGTG TTGGGCGC C A 
TCGGCATCTG GATCGCCATC CGGCCCAAGC CGGTACAGCC GCCTCAGCCG GTTGCGGAGG 
AGCGCCTTAG CGCCCTACTG CTGAACTCCT CAGAAGTCAA CGCCGTGATG GGCTCGTCGT 
CCATGCAGCC GGGCAAACCG ATCACATCGA TGGACTCTTC GCCGGTGACG GTGTCCCTGC 
CGGACTGCCA GGGCGCGCTG TAT AC CAGCC AGGATCCGGT GTATGCCGGC ACCGGCTACA 
CCGCCATCAA CGGCTTGATT TCATCCGAGC CGGGCGACAA CTACGAACAT TGGGTGAACC 
AAGCCGTCGT CGCCTTTCCG ACCGCCGACA AAGCCCGCGC GTTCGTGCAG ACTTCGGCCG 
ACAAATGGAA GAACTGCGCA GGCAAGACGG TCACCGTCAC GAATAAGGCC AAGACCTACC 
GGTGGACGTT TGCCGACGTC AAAGGCAGCC CGCCGACGAT CACGGTGATA GACACC CAAG 
AAGGCGCTGA GGGCTGGGAA TGCCAACGCG CGATGAGCGT GGCCAACAAT GTGGTTGTCG 
ACGTCAACGC ATGCGGGTAC CAGATCACCA ATCAAGCAGG CCAGATCGCC GCCAAGATCT 
GTTGACAAAG TCAACAAGG 

(2) INFORMATION FOR SEQ ID NO: 270: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 270: 

AGACGTCGTC GAGGCCGCCA TCGCCCGCGC CGAAGCCGTT AACCCGGCAC TGAACGCGTT 
GGCGTATGC 

(2) INFORMATION FOR SEQ ID NO: 271: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 523 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:271: 
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ACTGCACCCG GCAGGCGCGA CCAACGGATC GGGTCAACTA GCACTGCCGG TGGAGGCGCC 60 

CCCGCGGTCT GTGCCTTCCC ACGGGGAACC CTTGGGCAGC GCGGCTCCAG AAGGGTTGGA 120 

GGGAGAGTTC GACGACCGTA TCGACGAGCG GTTCCCGGTC TTCAGCTCGG CCAGTCTCGC 180 

CGAAGCGCTG CCGGGTCCGC TGACCC CGAT GACGCTGGAT GTC CAGTTGA GTGGACTGCG 240 

CGCGGCCGGT CGGGCGATGG GTCGGGTACT GGCGCTTGGC GGTGTCGTTG CCGATGAGTG 300 

GGAGAGAAGA GCCATCGCGG TGTTCGGTCA CCGCCCGTAT ATCGGAGTGT CGGCCAATAT 360 

TGTGGCCGCC GCCCAACTGC CGGGGTGGGA CGCGCAGGCC GTAACCCGGC GGGCACTGGG 42 0 

CGAGCAACCG CAGGTCACTG AGCTGCTTCC GTTTGGTCGA CCGCAACTTG CGGGCGGACC 4 80 

GCTCGGCTCG GTCGCGAAGG TGGTCGTGAC GGCACGGTCG CTG 5^3 

(2) INFORMATION FOR SEQ ID NO: 2 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 224 base pairs 

(B) TYPE: nucleic acid 
CO STRANDEDNESS : single 
(D) TOPOLOGY: linear 

5J (ii) MOLECULE TYPE: cDNA 

S 

m (xi) SEQUENCE DESCRIPTION: SEQ ID NO:272: 

09 GTGTCGGTGT CGTCGGGGTA GGAGCGACTT CCCCGGCCGG CGCCGGCGCC GGAGCGGGCT 60 

CTGCAGGAAC CGGTGCCGGC GCCGGCGGCG GGGCGACCAA AGGCCGGATC GATTCGGCCA 120 

W GCGCCTTGGC CGCGCCCTTG TCCACCGGGT TGTTGGCGGT CCCGAGCCAT ACCACAAACC 180 

£ AACGCTGAAG GGGCCCGGCG TCCGGTGCGT TCGCCGCGGG CGAC 224 

□ 

jy (2) INFORMATION FOR SEQ ID NO:273: 

F5 5 
s IL3 

\j\ ■ (i) SEQUENCE CHARACTERISTICS: 

Q (A) LENGTH: 521 base pairs 

jy (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:273: 

TGAACTGACT GCCCCGCTCG ATCGGCGGCG GCGGCGTGTC ATAGCTGCGC CGCCAGGCCA 60 

TGAACTGCTC TTCGCCATAG CGGGCCTTGG TCTCGGCCTT GTCCAAACCC TGCAGCGCGC 12 0 

CGTAGTGGCG TTCGTTGAGC CGCCAGCTAC GCCGCACGGG AATCCAGAGC CGATCGGCGC 180 , 

TGTCCAACGC CAGATGCGCG GTGGTGATCG CGCGCCGCAG CAACGAGGTG TAGAGCACGT 24 0 

CGGGCAATAG GTCGTGTTCC GCGATCAGCT CGCCGCTTCG AACCGCCTCT GCCTGGCCCT 300 

TGTCCGTCAG GCCGACATCG ACCCAGCCGG TGAACAGGTT GAGGGCATTC CAGTCGCTCT 360 

CGCCGTGGCG CAGCAACACC AGGCTGCCAG TGTTTGCCAT AC CGGCAAGT CTCTCACGCA 420 

CTCCCGCACT CCTCATCGTG GACCAAAATG CCCGAATTCT CCTCGGTCCG CTGCGCAGCG 480 

CGTTCATACC GCCGAGGTGG TCGGCACCGT AACGGCCGGT T 521 

(2) INFORMATION FOR SEQ ID NO: 274: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 426 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:274: 

CTCCAGGCTC ATTCGCTCGA ACAAAGCCAC CCGGCCGTAC AGCGGACGCC CCCATTCGTT 
GTCGTGATAG TCGCGGTACA GCTGGGCATC GGGCCCTGGA CGAACCTCCG CCCAGGGGCA 
GCGAACCAGC CCGTCGCCGC TCACGCGGGG TCAGAACGGT AGTGCACGAC AGTCTCGCCG 
CGCGAAGGGT TTGACGCGTC AGACTCGGCC TCGGCGTCTT CCGACGAGGC GTGGATCGCC 
CCGAGCTGAG AGCGTAGCGC CTCGAGCTCA CGGCCGAGCC GTTCCAGCAC CCAGTCCACC 
TCGCTGGTCT TGTTCCCGCG CAGCACCTGC GTGAACTTGA CCGCGTCGAC ATCGGCGCGG 
GTGACCCCGA ACGCCGGCAG CGTCGTCGCC GTCGTCGCCC GCGGCAGGGG CGGCAACTGC 
TCGCCA 

( 2) INFORMATION FOR SEQ ID NG : 2 7 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 219 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:275: 

GCGGACACGG CGGACAAAGC GCAATCGGCC TCGGCGGCGG CGCCGGCGGC GACGGGGGCC 
AGGGCGGCGC CGGCCGCGGA CTGTGGGGTA CTGGCGGCGC CGGCGGACAC GGCGGGGCAA 
GGCGGTGGTA CCGGGGGCCC ACCGCTGCCC GGTCAGGCAG GCATGGGCGC CGCGGGTGGC 
GCCGGTGGGC TGATCGGCAA CGGCGGGGCC GGCGGCGAC 

(2) INFORMATION FOR SEQ ID NO: 276: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 571 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:276: 

AAGATCATCG GCGCCGCTCC TTAGCATCGC TGCGCTCTGC ATCGTCGCCG GCGCGGATCA 

CGGAGGTCCG GCCTTGTACC CCACTCCTCG AACGGTCAGC ACCACAGTCG GGTTCTCGGG 

ATCCTTTTCG ACCTTGGCCC GCAGACGCTG GACATGCACG TTCACCAGCC TGGTATCGGC 

TGGGTGCCGG TAACCCCATA CCTGTTCGAG CAGCACATCA CGAGTAAACA CCTGGCGCGG 

CTTGCGCGCC AATGCGACCA ACAGGTCGAA TTCCAGCGGT GTCAACGAGA TCTGCTCACC 

GTTGCGAGTG ACCTTGTGCG CCGGTACGTC GATTTCTACG TCGGCGATGG ACAGCATCTC 

GGCGGGTTCG TCGTCGTTGC GGCGCAGCCG CGCCCGCACC CGCGCAACCA GCTCCTTGGG 

CTTGAACGGC TTCATGATGT AGTCGTCGGC GCCCGACTCC AGACCCAGCA CCACATCCAC 

GGTGTCGGTC TTTGCGGTGA GCATCACGAT CGGAACACCG GAATCGGCGC GCAACACCCG 

GCACACGTCG ATGCCGTTCA TACCGGGGCA A 
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(2) INFORMATION FOR SEQ ID NO: 277: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 277: 



Leu Phe 
1 


Gly 


Ala 


Gly 
5 


Gly Val 


Gly 


Phe Leu 


Gly 


Thr 

2G 


Ala 


Pro Gly 


Gly 


Gly Leu 


Phe 


Ser 


Val 


Gly Gly 


Ala 




35 








40 


Val Gly 


Asn 


Ser 


Gly 


Ala Gly 


Gly 


50 








55 




Gly Asp 


Gly 


Gly 


Ala 


Gly Gly 


Ala 


65 








70 




Gly Ala 


Gly 


Gly 


Ala 


Gly Gly 


Asn 



85 



Gly 


Val Gly 


Gly Asp 


Gly Val Ala 




10 




15 


Pro 


Gly Gly 


Ala Gly Gly Ala Gly 


25 






30 


Gly 


Gly Ala 


Gly Gly 


lie Gly Leu 






45 




Ser 


Gly Gly 


Ser Ala 


Leu Leu Trp 






60 




Gly 


Gly Val 


Gly Ser 


Thr Thr Gly 




75 




80 


Ala 


Ser Leu 


Leu Val 






90 







(2) INFORMATION FOR SEQ ID NO: 278: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:278: 



Met Pro Pro Val Ser Ala Asn Ala Met Val Pro Ala His Ser Thr Pro 

15 10 15 

Pro Val Ala Asn lie Glu Val Asn Thr Pro 
20 25 



(2) INFORMATION FOR SEQ ID NO: 2 79: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 279: 
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Lys Pro Asp Arg Pro Ala Ala Thr Val Gly Ser Cys Thr Thr Val Arg 

15 10 15 

Ala Pro Cys Ser Gin Pro Val Thr Thr Ala 
20 25 

(2) INFORMATION FOR SEQ ID NO: 280: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:280: 

Trp Pro Ala Gly Arg Pro Met His Pro Ala Pro Gly Thr Ser Ala Asp 

15 10 15 

His Pro Pro Asn 
20 

(2) INFORMATION FOR SEQ ID NO: 281: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 140 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:281: 



Val 


Leu 


Val 


Ala 


Gly 


Cys 


Ser 


Ser 


Asn 


Pro 


Leu 


Ala 


Asn Phe Ala Pro 


l 








5 










10 






15 


Gly Tyr 


Pro 


Pro 


Thr 


He 


Glu 


Pro 


Ala 


Gin 


Pro 


Ala 


Val Ser Pro Pro 








20 










25 








30 


Thr 


Ser 


Gin 
35 


Asp 


Pro 


Ala 


Gly 


Ala 
40 


Val 


Arg 


Pro 


Leu 


Ser Gly His Pro 
45 


Arg 


Ala 
50 


Ala 


Leu 


Phe 


Asp 


Asn 
55 


Gly 


Thr 


Arg 


Gin 


Leu 
60 


Val Ala Leu Arg 


Pro 


Gly 


Ala 


Asp 


Ser 


Ala 


Ala 


Pro 


Ala 


Ser 


He 


Met 


Val Phe Asp Asp 


65 








70 










75 




80 


Met 


His 


Val 


Ala 


Pro 
85 


Arg 


Val 


He 


Phe 


Leu 
90 


Pro 


Gly 


Pro Ala Ala Ala 
95 


Leu 


Thr 


Ser 


Asp 


Asp 


His 


Gly 


Thr 


Ala 


Phe 


Leu 


Ala Ala Arg Gly Gly 








100 










105 








110 


Tyr 


Phe 


Val 


Ala 


Asp 


Leu 


Ser 


Ser 


Gly His 


Thr 


Ala 


Arg Val Asn Val 




115 










120 










125 


Ala 


Asp 
130 


Ala 


Ala 


His 


Thr 


Asp 
135 


Phe 


Thr 


Ala 


He 


Ala 
140 





(2) INFORMATION FOR SEQ ID NO: 2 82: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 142 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:282: 



Met His lie Thr 
1 

Gly Ser Glu Leu 
20 

Leu Gly Ser Arg 
35 

Arg Leu Ser Pro 
50 

Thr Val He Asp 
65 

Ala Asp Arg Thr 

Asp Gly Ser He 
100 

Leu Leu Ala Ala 
115 

Ser Asn Gly Ser 
130 



Leu Asn Ala He 
5 

Asp Glu Leu Arg 

Leu Ala Ala Leu 
40 

Trp Gly Arg Leu 
55 

Glu Leu He Glu 
70 

Asp Val Leu Ala 
85 

Met Ser Arg Lys 

Gly His Glu Thr 
120 

Thr Gly Thr Pro 
135 



Leu Arg Ala He 
10 

Arg Leu He Pro 
25 

Pro Lys Pro Lys 

Ala Glu Trp Arg 
60 

Ala Glu Arg Ala 
75 

Leu Met Leu Arg 
90 

Asp He Gly Asp 
105 

Thr Ala Ala Thr 

Thr Cys Ser Arg 
140 



Phe Gly Ala Gly 
15 

Pro Trp Val Thr 
30 

Arg Asp Tyr Gly 
45 

Arg Gin Tyr Asp 

Asp Pro Asn Phe 
80 

Ser Thr Tyr Asp 
95 

Glu Leu Leu Thr 
110 

Trp Ala Gly Arg 
125 

Leu Trp 



(2) INFORMATION FOR SEQ ID NO: 283: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 163 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:283: 



Val Leu Val Ala 
1 

Gly Tyr Pro Pro 
20 

Thr Ser Gin Asp 
35 

Arg Ala Ala Leu 
50 

Pro Gly Ala Asp 
65 

Val His Val Ala 

Leu Thr Ser Asp 
100 



Gly Cys Ser Ser 
5 

Thr He Glu Pro 

Pro Ala Gly Ala 
40 

Phe Asp Asn Gly 
55 

Ser Ala Ala Pro 
70 

Pro Arg Val He 
85 

Asp His Gly Thr 



Asn Pro Leu Ala 
10 

Ala Gin Pro Ala 
25 

Val Arg Pro Leu 

Thr Arg Gin Leu 
60 

Ala Ser He Met 
75 

Phe Leu Pro Gly 
90 

Ala Phe Leu Ala 
105 



Asn Phe Ala Pro 
15 

Val Ser Pro Pro 
30 

Ser Gly His Pro 
45 

Val Ala Leu Arg 

Val Phe Asp Asp 
80 

Pro Ala Ala Ala 
95 

Ala Arg Gly Gly 
110 



Tyr Phe Val Ala 
115 

Ala Asp Ala Ala 
130 

Gly Lys Leu Val 
145 

Lys Asn Pro 



Asp Leu Ser Ser 
120 

His Thr Asp Phe 
135 

Leu Gly Ser Ala 
150 
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Gly His Thr Ala 

Thr Ala lie Ala 
140 

Asp Gly Ala Val 
155 



Arg Val Asn Val 
125 

Arg Arg Ser Asp 

Tyr Thr Leu Ala 
160 



(2) INFORMATION FOR SEQ ID NO: 284: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
CD) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO:284: 



Trp 


Gly 


Ala 


Pro 


Pro 


Ser 


Gly 


Gly Pro 


Ser 


Pro 


Trp 


Ala 


Gin 


Thr 


Pro 


1 








5 










10 










15 




Arg 


Lys 


Thr 


Asn 


Pro 


Trp 


Pro 


Leu 


Val 


Ala 


Gly 


Ala 


Ala 


Ala 


Val 


Val 








20 










25 










30 






Leu 


Val 


Leu 


Val 


Leu 


Gly 


Ala 


He 


Gly 


He 


Trp 


He 


Ala 


He 


Arg 


Pro 






35 










40 










45 








Lys 


Pro 


Val 


Gin 


Pro 


Pro 


Gin 


Pro Val 


Ala 


Glu 


Glu 


Arg 


Leu 


Ser 


Ala 




50 










55 










60 










Leu 


Leu 


Leu 


Asn 


Ser 


Ser 


Glu 


Val 


Asn 


Ala 


Val 


Met 


Gly 


Ser 


Ser 


Ser 


65 










70 










75 










80 


Met 


Gin 


Pro 


Gly 


Lys 


Pro 


He 


Thr 


Ser 


Met 


Asp 


Ser 


Ser 


Pro 


Val 


Thr 










85 










90 










95 




Val 


Ser 


Leu 


Pro 


Asp 


Cys 


Gin 


Gly Ala 


Leu 


Tyr 


Thr 


Ser 


Gin 


Asp 


Pro 








100 










105 










110 






Val 


Tyr 


Ala 


Gly 


Thr 


Gly 


Tyr 


Thr 


Ala 


He 


Asn 


Gly 


Leu 


He 


Ser 


Ser 






115 










120 










125 








Glu 


Pro 


Gly 


Asp 


Asn 


Tyr 


Glu 


His 


Trp 


Val 


Asn 


Gin 


Ala 


Val 


Val 


Ala 




130 










135 










140 










Phe 


Pro 


Thr 


Ala 


Asp 


Lys 


Ala 


Arg 


Ala 


Phe 


Val 


Gin 


Thr 


Ser 


Ala 


Asp 


145 










150 










155 










160 


Lys 


Trp 


Lys 


Asn 


Cys 


Ala 


Gly 


Lys 


Thr 


Val 


Thr 


Val 


Thr 


Asn 


Lys 


Ala 










165 










170 










175 




Lys 


Thr 


Tyr 


Arg 


Trp 


Thr 


Phe 


Ala 


Asp 


Val 


Lys 


Gly 


Ser 


Pro 


Pro 


Thr 








180 










185 










190 






He 


Thr 


Val 


He 


Asp 


Thr 


Gin 


Glu 


Gly 


Ala 


Glu 


Gly 


Trp 


Glu 


Cys 


Gin 






195 










200 










205 








Arg 


Ala 


Met 


Ser 


Val 


Ala 


Asn 


Asn 


Val 


Val 


Val 


Asp 


Val 


Asn 


Ala 


Cys 




210 










215 










220 










Gly 


Tyr 


Gin 


He 


Thr 


Asn 


Gin 


Ala Gly 


Gin 


He 


Ala 


Ala 


Lys 


He 


Cys 


225 










230 










235 










240 



(2) INFORMATION FOR SEQ ID NO: 28 5: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 285: 

Asp Val Val Glu Ala Ala lie Ala Arg Ala Glu Ala Val Asn Pro Ala 

1 5 10 15 

Leu Asn Ala Leu Ala Tyr 
20 



(2) INFORMATION FOR SEQ ID NO: 28 6: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 174 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:286: 



Leu 


His 


Pro 


Ala 


Gly 


Ala 


Thr 


Asn 


Gly 


Ser 


Gly 


Gin Leu Ala Leu Pro 


1 








5 










10 




15 


Val 


Glu 


Ala 


Pro 


Pro 


Arg 


Ser 


Val 


Pro 


Ser 


His 


Gly Glu Pro Leu Gly 








20 










25 






30 


Ser 


Ala 


Ala 


Pro 


Glu 


Gly 


Leu 


Glu 


Gly 


Glu 


Phe 


Asp Asp Arg He Asp 






35 










40 








45 


Glu 


Arg 


Phe 


Pro 


Val 


Phe 


Ser 


Ser 


Ala 


Ser 


Leu 


Ala Glu Ala Leu Pro 




50 










55 










60 


Gly 


Pro 


Leu 


Thr 


Pro 


Met 


Thr 


Leu 


Asp 


Val 


Gin 


Leu Ser Gly Leu Arg 


65 










70 










75 


80 


Ala 


Ala 


Gly 


Arg 


Ala 


Met 


Gly 


Arg 


Val 


Leu 


Ala 


Leu Gly Gly Val Val 










85 










90 




95 


Ala 


Asp 


Glu 


Trp 


Glu 


Arg 


Arg 


Ala 


He 


Ala 


Val 


Phe Gly His Arg Pro 








100 










105 






110 


Tyr 


He 


Gly 


Val 


Ser 


Ala 


Asn 


He 


Val 


Ala 


Ala 


Ala Gin Leu Pro Gly 






115 










120 








125 


Trp 


Asp 


Ala 


Gin 


Ala 


Val 


Thr 


Arg 


Arg 


Ala 


Leu 


Gly Glu Gin Pro Gin 




130 










135 










140 


Val 


Thr 


Glu 


Leu 


Leu 


Pro 


Phe 


Gly 


Arg 


Pro 


Gin 


Leu Ala Gly Gly Pro 


145 










150 










155 


160 


Leu 


Gly 


Ser 


Val 


Ala 


Lys 


Val 


Val 


Val 


Thr 


Ala 


Arg Ser Leu 



165 170 



(2) INFORMATION FOR SEQ ID NO: 287: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 61 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 



y 

s 

?=■ 
Scs? 

fy 

in 

Q 
ro 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:287: 

Val Gly Val Val Gly Val Gly Ala Thr Ser Pro Ala Gly Ala Gly Ala 

15 10 15 

Gly Ala Gly Ser Ala Gly Thr Gly Ala Gly Ala Gly Gly Gly Ala Thr 

20 25 30 

Lys Gly Arg He Asp Ser Ala Ser Ala Leu Ala Ala Pro Leu Ser Thr 

35 40 45 

Gly Leu Leu Ala Val Pro Ser His Thr Thr As n Gin Arg 
50 55 60 

(2) TN FORMAT I ON FOR SEQ ID NO: 288: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 3 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:288: 



Met 


Ala 


Asn 


Thr 


Gly 


Ser 


Leu 


Val 


Leu 


Leu 


Arg 


His 


Gly Glu Ser Asp 


1 








5 










10 






15 


Trp 


Asn 


Ala 


Leu 


Asn 


Leu 


Phe 


Thr Gly Trp 


Val 


Asp 


Val Gly Leu Thr 








20 










25 








30 


Asp 


Lys 


Gly 


Gin 


Ala 


Glu 


Ala 


Val 


Arg 


Ser 


Gly Glu 


Leu He Ala Glu 






35 










40 










45 


His 


Asp 
50 


Leu 


Leu 


Pro 


Asp 


Val 
55 


Leu 


Tyr 


Thr 


Ser 


Leu 
60 


Leu Arg Arg Ala 


He 


Thr 


Thr 


Ala 


His 


Leu 


Ala 


Leu 


Asp 


Ser 


Ala 


Asp 


Arg Leu Trp He 


65 










70 










75 




80 


Pro 


Val 


Arg 


Arg 


Ser 
85 


Trp 


Arg 


Leu 


Asn 


Glu 
90 


Arg 


His 


Tyr Gly Ala Leu 
95 


Gin 


Gly 


Leu 


Asp 
100 


Lys 


Ala 


Glu 


Thr 


Lys 
105 


Ala 


Arg 


Tyr 


Gly Glu Glu Gin 
110 


Phe 


Met 


Ala 
115 


Trp 


Arg 


Arg 


Ser 


Tyr 
120 


Asp 


Thr 


Pro 


Pro 


Pro Pro He Glu 
125 


Arg 


Gly 
130 


Ser 


Gin 


Phe 



















(2) INFORMATION FOR SEQ ID NO: 289: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 3 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 289: 

Pro Gly Ser Phe Ala Arg Thr Lys Pro Pro Gly Arg Thr Ala Asp Ala 

1 5 io 15 

Pro lie Arg Cys Arg Asp Ser Arg Gly Thr Ala Gly His Arg Ala Leu 

20 25 30 

Asp Glu Pro Pro Pro Arg Gly Ser Glu Pro Ala Arg Arg Arg Ser Arg 

35 40 45 

Gly Val Arg Thr Val Val His Asp Ser Leu Ala Ala Arg Arg Val 
50 55 60 



(2) INFORMATION FOR SEQ ID NO: 2 90: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 72 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 





(ii) MOLECULE 


TYPE : prote 




(xi) SEQUENCE 


DESCRIPTION 


Gly His Gly Gly Gin 


Ser Ala lie 


1 


5 




Asp 


Gly Gly Gin Gly 


Gly Ala Gly 




20 




Ala 


Gly Gly His Gly 


Gly Ala Arg 




35 


40 


Ala 


Arg Ser Gly Arg 


His Gly Arg 




50 


55 


Arg 


Gin Arg Arg Gly 


Arg Arg Arg 


65 




70 



in 

: SEQ ID NO: 290: 

Gly Leu Gly Gly Gly Ala Gly Gly 

10 15 
Arg Gly Leu Trp Gly Thr Gly Gly 
25 30 
Arg Trp Tyr Arg Gly Pro Thr Ala 
45 

Arg Gly Trp Arg Arg Trp Ala Asp 
60 



(2) INFORMATION FOR SEQ ID NO: 2 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 91: 

Asp His Arg Arg Arg Ser Leu Ala Ser Leu Arg Ser Ala Ser Ser Pro 

1 5 10 15 

Ala Arg lie Thr Glu Val Arg Pro Cys Thr Pro Leu Leu Glu Arg Ser 

20 25 30 

Ala Pro Gin Ser Gly Ser Arg Asp Pro Phe Arg Pro Trp Pro Ala Asp 

35 40 45 

Ala Gly His Ala Arg Ser Pro Ala Trp Tyr Arg Leu Gly Ala Gly Asn 
50 55 60 
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Pro lie Pro Val Arg Ala Ala His His Glu 
65 70 

(2) INFORMATION FOR SEQ ID NO: 2 92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 174 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:292: 



CCGCACGTAA CACCGTGAAT TGAAGGGAGC 
CGAACGGTTA TTGACGGCCC GGAGGCCACT 
TTTCACGGCA ACGAACGGCG GACACACCAC 



CGCTGGTCAT GGGCCGATTC TATCCGTGGG 
CCGCTGCCAC CAAGTGGTGA CTCAGCGCGT 
TTGACATTCG ACAGCACGGC CGCG 



(2) INFORMATION FOR SEQ ID NO: 293: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 04 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:293: 

TCGCAAACGG GGTGACGTTC CGTCCGGTGG CGCTAGAGAG TTTGTCGCAC TTTCCGGTGA 
CCGTCGCCGC GCACCGCAGC ACCGGTGAGC TCACGCTGCT AGTGGAGGTG CTCGACGGTG 
CGCTGGGCAC GATGGCGCCC GAAAGCCTCG GCAGGCGGGT GCTGGCTGTG TTACAGCGCT 
TGGTCAGCCG GTGGGATCGG CCGCTGCGCG ACGTCGACAT TCTGCTGGAC GGCGAGCACG 
ATCCGACCGC ACCCGGCCTG CCGGATGTGA CGACGTCGGC ACCCGCGGTG CATACCCGGT 
TCGCCGAAAT CGCTGCGGCA CAGCCTGACT CGGTGGCGGT CAGTTGGGCG GATGGTCAGC 
TGACGTACCG GGAGCTGGAT GCATTGGCCG ACCGGCTGGC CACT 

(2) INFORMATION FOR SEQ ID NO: 2 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 134 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:294: 

Ala Asn Gly Val Thr Phe Arg Pro Val Ala Leu Glu Ser Leu Ser His 

1 5 io 15 

Phe Pro Val Thr Val Ala Ala His Arg Ser Thr Gly Glu Leu Thr Leu 
20 25 30 
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Leu 


Val 


ulu 


vax 


Leu 


Asp Gly Ala 


Leu Gly Thr Met Ala 


Pro 


Glu 


Ser 












40 




45 








Leu 




Arg 


Arg 


val 


Leu Ala Val 


Leu 


Gin Arg Leu Val 


Ser 


Arg 


Trp 




50 








55 




60 




Asp 


Arg 


Pro 


Leu 


Arg 


Asp Val Asp 


He 


Leu Leu Asp Gly 


Glu 


His 


Asp 


65 










70 




75 






80 


Pro 


Thr 


Ala 


Pro 


Gly 
85 


Leu Pro Asp 


Val 


Thr Thr Ser Ala 
90 


Pro 


Ala 
95 


Val 


His 


Thr 


Arg 


Phe 
100 


Ala 


Glu lie Ala 


Ala 
105 


Ala Gin Pro Asp 


Ser 
110 


Val 


Ala 


Val 


Ser 


Trp 


Ala 


Asp 


Gly Gin Leu 


Thr 


Tyr Arg Glu Leu 


Asp 


Ala 


Leu 






115 






120 




125 






Ala 


Asp 
130 


Arg 


Leu 


Ala 


Thr 













(2) INFORMATION FOR SEQ ID NO: 2 95: 

(i) SEQUENCE CHARACTERISTICS: 
O (A) LENGTH: 526 base pairs 

y (B) TYPE: nucleic acid 

CO (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



00 



(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 295: 



f|j GCTTCGACGG CTACGAGTAC CTGTTCTGGG TGGGTTGTGC GGGCGCCTAC GACGACAAGG 60 

ffj CCAAGAAGAC CACCAAGGCC GTCGCCGAGC TGTTCGCCGT CGCCGGGGTG AAATACTTGG 120 

Lff TGCTGGGCGC TGGGGAAACC TGCAACGGCG ACTCGGCGCG CCGCTCCGGC AACGAGTTCC 180 

□ TCTTCCAGCA GCTGGCACAA CAGGCCGTCG AGACC CTGGA CGGTTTGTTC GAGGGTGTGG 24 0 

fd AGACCGTCGA CCGCAAGATC GTTGTCACCT GCCCGCACTG CTTCAACACC ATCGGCAAGG 3 00 

AATATCGGCA GCTGGGCGCC AACTACACCG TGCTGCACCA CACCCAGCTG CTCAATCGGT 360 

TGGTGCGCGA CAAGAGGCTG GTCCCTGTCA CTCCGGTTTC TCAGGACATC ACCTACCACG 420 

ACCCGTGCTA CCTGGGTCGG CACAACAAGG TCTACGAGGC ACCACGGGAG CTGATCGGTG 480 

CCGCGGGGGC CACCTGAGCC GAGATGCCGC GCCATGCCGA CCGCAG 526 

(2) INFORMATION FOR SEQ ID NO: 2 96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 487 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:296: 

CTCGCCGCCG TGATCTGGCC GGCGAACTTC GTCAGTGCAT CCAGACCCCA ACGATCATCG 60 

ATCAGGCCGA TGCCCATGAT CACCGCACCG GCCACCAGCA CCGCGGGCAT GCCGGTGGAA 12 0 

TAGACGAACC CCCGGGTGAG TGCCGGAAGC TGGGAGGCAA GAAAGACGGC GCCGACAATG 180 

CCCAGGAACA TCGCCAACCC ACCCATCCGA GGGGTAGGCG TGACGTGCAC ATCTCGCTCC 24 0 

CGCGGGTAGG CGACGGCTCC CAGGCGACTG GCCAGCATCC GCACCGGACC GGTCGCAAAA 300 

TAGGTGATGA TCGCCGCGGT CAGCCCGACC AGCGCAAGCT CACGCAGCGG GACACCGGCG 360 



420 
480 
487 



243 

CCGCGATAGG ACAGGGCGAG CAAGCCACCG GCAACGCCGG CCACATCGCT GGACACCTCG 
AGACCGTACT GCACCAACCT GAAGAGCTGA ACACTCGCCG AACGTGCAAC AGCTGCGAAC 
AATTGGG 

(2) INFORMATION FOR SEQ ID NO: 2 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 297: 

ACGAAGCGCG AGAATATGAG CCGGGGCAAC CCGGCATGTA CGAGCTTGAG TTCCCGGCGC 60 

£ CTCAGCTGTC GTCGTCCGAC GGCCGTGGTC CGGTGTTGGT GCACGCTTTG GAAGGTTTCT 120 

— CCGACGCCGG CCATGCGATC CGGCTGGCCG CCGCCCACCT CAAGGCGGCC CTGGACACAG 180 

W AGCTGGTCGC GTCCTTCGCG ATCGATGAAC TACTGGACTA CCGCTCGCGG CGGCCATTAA 24 0 

TGACTTTCAA GACCGATCAT TTCACCCACT CCGATGATCC TGAGCTAAGC CTGTATGCGC 3 00 

JTJ TGCGCGACAG CATCGGCACC CCATTTCTGC TGCTGGCGGG TTTGGAGCCG GAC CTGAAGT 360 

^ GGGAGCGGTT CATCACCGCC GTCCGATTGC TGGCCGAGCG CCTGGGTGTA CGGCAGAACC 420 

+ ATCGGC CTGG GCACCGTCCC GATGGCCGTT CCGCACACAC GACCGATCAC GATGACCGCT 48 0 

CATTCCAACA ACCGGGAGCT ATCTCCGATT TTCAACCGTT CGATCTCC 528 

5 

O (2) INFORMATION FOR SEQ ID NO-298- 

m 

FU (i) SEQUENCE CHARACTERISTICS: 

Ifl < A > LENGTH : 610 base pairs 

O (B) TYPE: nucleic acid 

fU (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 298: 



CCAAGCCCGT CAAGGAGCCG GTGCCGGCCT TGCCTCCGGT GCCGCCGACG CCGGCGTTGC 
CGCCGTTGCC GCCGTTGCCG CCGGTACCGG GGTTTCCTAC GGTGCCGCCG CCCGGCAGCA 
TGGCCCCGCT GTTTAGGCCG TTTTCGCCGG CCCCGCCGTC AC CGGCTTTG CCGCCATCGC 
CGCCGTTGCC GCCGCTGGTG GGGGTGGCGG CCTGGTTGAC GTATTGTTCC ACCGGCCCGG 
CCCTTGACCC TTTGGCGGTG TCGATCGCGG CGTCGATGGA TCCGCCGACC ACGACGTGCG 
AAGCCTCGCC TGCCGCCGCA GCCGCCCAAC TGTGTCGCGG CTCCTGCGAT TTGGCCCCGG 360 
CCGACGAGAT GATGGGCACC ACCGGAGCCT GCGGCCGTCT GGGGGAGGCC AGCGCGGGTT 420 
CGCGGTCACG CCATACGCGA CGGTGCGCCG CCGCTTCGGA GATTTGCAGG CTGCGTTGCA 480 
CCAGATCGAG CAGCGGTGTG CCCAGGGACT GGGTTAGCCC GTTGGCGCCG CCGTTGTAGC 54 0 

GGCGAGCGCA ATATCGGTGC CCACTCGACC CAACCGCGAC TCCATAAGCG ACACCATTCG 600 
CGGTTGATGC " ~ " 

610 



60 
120 
180 
240 
300 



(2) INFORMATION FOR SEQ ID NO: 299 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 164 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 299: 





IT HC no^J 


Glv 




Glu 


Tvr 


Leu 


Phe 




Val 


Glv 


Cys 


Ala 


Gly Ala Tyr 




J. 






5 










10 










15 






T Aye* 


Ala 

\J 


Lys 


Lys 


Thr 


Thr 


Lys 
25 


Ala 


Val 


Ala 


Glu 


Leu 
30 


Phe Ala 




Val Ala 


Gly 


Val 


Lys 


Tyr 


Leu 


Val 


Leu 


Gly 


Ala 


Gly Glu 


Thr 


Cys Asn 






35 










40 










45 








Gly Asp 


Ser 


Ala 


Arg 


Arg 


Ser 


Gly 


Asn 


Glu 


Phe 


Leu 


Phe 


Gin 


Gin Leu 




50 










55 










60 










Ala Gin 


Gin 


Ala 


Val 


Giu 


Thr 


Jjera 


Asp 


Gly 


Leu 


Phe 




Sly 


Val Glu 




65 








70 










75 








80 




Thr Val 


Asp 


Arg 


Lys 


He 


Val 


Val 


Thr 


Cys 


Pro 


His 


Cys 


Phe 


Asn Thr 










85 










90 










95 


□ 


He Gly 


Lys 


Glu 


Tyr 


Arg 


Gin 


Leu 


Gly 


Ala 


Asn 


Tyr 


Thr 


Val 


Leu His 






100 










105 










110 




r 53 


His Thr 


Gin 


Leu 


Leu 


Asn 


Arg 


Leu 


Val 


Arg 


Asp 


Lys 


Arg 


Leu 


Val Pro 


00 




115 










120 










125 








Val Thr 


Pro 


Val 


Ser 


Gin 


Asp 


He 


Thr 


Tyr 


His 


Asp 


Pro 


Cys 


Tyr Leu 


2 = 3 


130 










135 










140 








£ 


Gly Arg 


His 


Asn 


Lys 


Val 


Tyr 


Glu 


Ala 


Pro 


Arg 


Glu 


Leu 


He 


Gly Ala 


o 


14 5 








150 










155 








160 


5=3 5 

§y 


Ala Gly 


Ala 


Thr 

























m 
un 

P (2) INFORMATION FOR SEQ ID NO: 300: 

§y (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 161 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:300: 



Arg 


Arg 


Arg 


Asp 


Leu 


Ala 


Gly 


Glu 


Leu 


Arg Gin 


Cys 


He Gin 


Thr 


Pro 


1 








5 










10 






15 




Thr 


He 


He 


Asp 


Gin 


Ala 


Asp 


Ala 


His 


Asp His 


Arg Thr Gly His 


Gin 








20 










25 






30 






His 


Arg 


Gly 


His 


Ala 


Gly 


Gly 


He 


Asp 


Glu Pro 


Pro 


Gly Glu 


Cys 


Arg 






35 










40 








45 






Lys 


Leu 


Gly 


Gly 


Lys 


Lys 


Asp 


Gly 


Ala 


Asp Asn 


Ala 


Gin Glu 


His 


Arg 




50 










55 








60 








Gin 


Pro 


Thr 


His 


Pro 


Arg 


Gly 


Arg 


Arg 


Asp Val 


His 


He Ser 


Leu 


Pro 


65 










70 








75 








80 


Arg 


Val 


Gly 


Asp 


Gly 


Ser 


Gin 


Ala 


Thr 


Gly Gin 


His 


Pro His 


Arg 


Thr 










85 










90 






95 




Gly Arg 


Lys 


He 


Gly 


Asp 


Asp 


Arg 


Arg 


Gly Gin 


Pro 


Asp Gin 


Arg 


Lys 
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100 

Leu Thr Gin Arg Asp 
115 

Thr Gly Asn Ala Gly 
130 

Gin Pro Glu Glu Leu 

145 

Leu 



105 

Thr Gly Ala Ala He Gly 
120 

His He Ala Gly His Leu 
135 

Asn Thr Arg Arg Thr Cys 
150 ~ 155 



110 

Gin Gly Glu Gin Ala 
125 

Glu Thr Val Leu His 
140 

Asn Ser'Cys Glu Gin 
160 



(2) INFORMATION FOR SEQ ID NO: 3 01: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 175 amino acids 

(B) TYPE: amino acid 

CO STRANDEDNESS : s ingl e 
(D) TOPOLOGY: linear 





(ii) 


MOLECULE 


TYPE : protein 


















(xi) 


SEQUENCE 


DESCRIPTION: SEQ ID 


NO: 


301: 










Glu 


Ala Arg 


Glu 


Tyr 


Glu 


Pro 


Gly Gin 


Pro Gly Met 


Tyr 


Glu 


Leu 


Glu 


1 






5 








10 










15 




Phe 


Pro Ala 


Pro 


Gin 


Leu 


Ser 


Ser Ser 


Asp Gly Arg Gly Pro Val 


Leu 






20 








25 










30 






Val 


His Ala 


Leu 


Glu 


Gly 


Phe 


Ser Asp 


Ala Gly His 


Ala 


He 


Arg 


Leu 




35 










40 








45 






Ala 


Ala Ala 


His 


Leu 


Lys 


Ala 


Ala Leu 


Asp 


Thr 


Glu 


Leu 


Val 


Ala 


Ser 




50 








55 








60 










Phe 


Ala He 


Asp 


Glu 


Leu 


Leu 


Asp Tyr 


Arg 


Ser 


Arg Arg 


Pro 


Leu 


Met 


65 








70 








75 










80 


Thr 


Phe Lys 


Thr 


Asp 


His 


Phe 


Thr His 


Ser 


Asp 


Asp 


Pro 


Glu 


Leu 


Ser 








85 








90 










95 




Leu 


Tyr Ala 


Leu 


Arg 


Asp 


Ser 


He Gly 


Thr 


Pro 


Phe 


Leu 


Leu 


Leu 


Ala 






100 








105 










110 






Gly 


Leu Glu 


Pro 


Asp 


Leu 


Lys 


Trp Glu 


Arg 


Phe 


He 


Thr 


Ala 


Val 


Arg 




115 










120 








125 






Leu 


Leu Ala 


Glu 


Arg 


Leu 


Gly 


Val Arg 


Gin 


Asn 


His 


Arg 


Pro 


Gly His 




130 








135 








140 










Arg 


Pro Asp Gly 


Arg 


Ser 


Ala 


His Thr 


Thr 


Asp 


His 


Asp 


Asp 


Arg 


Ser 


145 








150 








155 








160 


Phe 


Gin Gin 


Pro 


Gly 


Ala 


He 


Ser Asp 


Phe 


Gin 


Pro 


Phe 


Asp 


Leu 










165 








170 










175 





(2) INFORMATION FOR SEQ ID NO: 3 02: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 178 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



246 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:302: 



Lys Pro Val Lys Glu Pro Val Pro Ala Leu Pro Pro Val Pro Pro Thr 

15 10 15 

Pro Ala Leu Pro Pro Leu Pro Pro Leu Pro Pro Val Pro Gly Phe Pro 

20 25 30 

Thr Val Pro Pro Pro Gly Ser Met Ala Pro Leu Phe Arg Pro Phe Ser 

35 40 45 

Pro Ala Pro Pro Ser Pro Ala Leu Pro Pro Ser Pro Pro Leu Pro Pro 

50 55 60 

Leu Val Gly Val Ala Ala Trp Leu Thr Tyr Cys Ser Thr Gly Pro Ala 
65 70 75 80 

Leu Asp Pro Leu Ala Val Ser lie Ala Ala Ser Met Asp Pro Pro Thr 

85 90 95 

Thr Thr Cys Glu Ala Ser Pro Ala Ala Ala Ala Ala Gin Leu Cys Arg 

X00 105 11 G 

Gly Ser Cys Asp Leu Ala Pro Ala Asp Glu Met Met Gly Thr Thr Gly 
H- 115 120 125 

Q Ala Cys Gly Arg Leu Gly Glu Ala Ser Ala Gly Ser Arg Ser Arg His 

O 130 135 140 

GO Thr Arg Arg Cys Ala Ala Ala Ser Glu lie Cys Arg Leu Arg Cys Thr 

=p 145 150 155 160 

DQ Arg Ser Ser Ser Gly Val Pro Arg Asp Trp Val Ser Pro Leu Ala Pro 

Jp 165 170 175 

LU Pro Leu 

5 

O 

fy (2) INFORMATION FOR SEQ ID NO: 3 03: 

=5 s 

yr? (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 921 base pairs 

(B) TYPE: nucleic acid 



ru 



(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 303: 



AATTCGGCAC GARCAGCACC AACACCGGCT TCTTCAACTC CGGCGACGTC AATACCGGTA 60 

TCGGCAACAC CGGCAGCTTC AACACCGGCA GCTTCAATCC GGGCGATTCC AACACCGGGG 120 

ATTTCAACCC ANGCAGCTAC CACACGGGGA CTCGGAAACA CCGGCGATTT TACACCGGCS 180 

CCTTCATCTC CGGCAGCTAC AGCAACGGGT CTTGTGGAGT GGAAATTATC AGGGCTCATT 24 0 

GGNTGCACCC GGSCTTRCGA ATCCCTCGKG CCAATTCAAC TCCTCNACAA GCTTGCGGCC 3 00 

GCACTCSAGC CCGGGTGAAT GATTGAGTTT AACCGCTNAN CAATAACTAG CATAACCCCT 360 

TKGGGCCTCT AAACGGGTCT TGAAGGGTTT TTTGCTGAAA GGANGAACTA TATCCGGATA 420 

ACTGGCGTAN TACGAAAAGC CGCACCGATC GCCTTCCCAA CAGTTGCGCA CCKGAATGGC 480 

AATGGAC CNC CCTKTTACCG GSCATTAACN CGGGGGTGTN GGKGTTACCC CCACGTNACC 54 0 

GCTACCTTGC CANNSSCCTN RSGCCGTCTT TCSTTTCTTC CTTCCTTCTC CCMCTTCGCC 600 

GGTTCCCNTC AGCTCTAAAT CGGGGNNCCC TTTMGGGTTC CAATTATTGC TTACNGSCCC 660 

CCACCCCAAA AAYTNATTNG GGTTAATGTC CCTTMTTGGG CNTCCCCCTA WTNANNGTTT 72 0 

TCCCCCTTNA CTTTGRS TCC CTTCYTTATW NTGAMNCTNT TTCCACYGGA AAAMNCTCCA 780 

CCNTTYS SGS TTTCCTTTGA WTTATMRGGR AATTS CAATY CCGCYTTKGG TTMAANTTAA 840 

CYTATTTCNA ATTTTCCCGM TTTTMMNATR TTNSNCKCGM KNCTC CNRKA SSGNTTTCCT 900 

CCCCCYTTSS GKTYCCCCRN G 921 
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(2) INFORMATION FOR SEQ ID NO: 3 04: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1082 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:304: 

AATTCGGCAC GAGATANGGG CGCACCGGGG TCCGCAGCCG GCGGGACCGT CGCCAGCACC 

ACCGGGGTCA ACAGCACCAC GGTGGCGTCC ANGCAGAGCG CCGCGGTGAT GGCGGCCGAG 

ACGGCRAACA CCTGCCGTAG CAGTCGGTGC GACTCCGCGC TCGCTCGANC CATGGCCGCG 

CCGGCTGCCT CGAACANGCC TTCGTCGTCC ACAGCTTAGC CAGCANCCAA ACCGCACCCA 

GAAACCCACA CGCCCGCCGC CCCGGANACC TGCGCCATCG KCTGCTGGGG CGANATCCCC 

CGATCGCTNA CANGATGACC GCTGCCGGAA CGCCGCCGCT GCCTCCGGGC AGCCGCGTGG 

GCSGGGCAAC CGCGAACCCA NGAACACGGC AAGCAGTATC ANCGCAACAG CAATTGTCAA 

GGGCTAAACG CTTCACATCC AGGGATCTCG CGGCGCCACA CCGTCGGMTC TGCAGS GCGA 

CCCCNTCCTN GGGCGGNCAC TCNTCAAAGA TGCNGATCNA CAGKCTAGGT CTTCGGCCGA 
TATGSAAGGN CCCAACGGNT TTAAAGCGGC SAAAAAASTC TCCCANTGGA TAAAATCAGC 

CGGGGANCCC CCCGTGSCMM NGTCYCGGKC ATTNTTCAAC MGGTTTNACG GCGGKTGCNG 

GCCAACTKGC CAAAMTTAAG KTNGGGGNTY CGGGGCGGTA ACCGGCNNTK NGCCCCTTAA 

AAAACCGGNC YTTTCTKGAT TAMMAC CGGN CCCCCAWTGG CGGKTGKTCC CANGNTYAAC 

AMCCYCCCSS MNGGGKTGGS SAACCCTTCC CGNGGGGTTC NTKGTTSCYT AWMCCCCCGG 

AAACCSGKYG GGKTGGCRTN WASSAMNCCC CMNGYYTCTT TAAAGGC CAN KNRAAWGKYT 

CCTTGGGAAW CCTNCAATYC GAAAAYYCTC CTYMMGSSCN CTTKCWRTYN NRNGGGAACS 

AMWTNYCCNC GWTTCAWTCG GGTCCGASMN AAACKCTTTY TTTTYCGSSC STCCMGGSNC 

SGGTKNANAN AAASATTTMC YYCNNNANKK YYYCSSGCTT CYKMGRRNRR GMGAAC CCGR 
GS 

(2) INFORMATION FOR SEQ ID NO: 3 05: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 990 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 305: 

AATTGGCACG AGTGATCGCG CTGAAGCCGG TAGCGCGGGT GGCTCGGGTG GTTTGCGAAC 
RAAATCCGCT CGANGTGGTC TCGGTAGGCG GTGTCCANAA CGGTGGCGCG GTGCCGGCGG 
ATCTGATCGG CGCGGCCGTA GTGCACGTCG GCGGGCGTGT GCAGTCCGAT GCCGGAATGC 
TTGTGTTCGT GGTTGTACCA GCCGAAGAAC CGGTCGCAGT GCACCCGGGC CGCCTCGATC 
GACTCGAACC GTTTCGGGAA ATCGGGCCGG TACTTGAAGG TCTYGAACTG GGCCTCAGAC 
AACGGGTTGT CTTGCTGGTG TGCGGGCGTG AGTGCGACTT GGTGACACCG AAGTCGGCCA 
NCANCAATGC CACCGGTTTG GAACTCATCC ACAACCCCCG TCCGCGTCMA GGTCACTTGT 
NCGGCGCTAA TTTNYTGGGC GGCAAGGGTT TGCCGAYCAN KCCGCTCGGC CAAAACTTCG 
ANTCNCSCCA AGGCCNCCAT CCNCCCAAAC AMGTTACGGG ANAAAANATY CAAAGAYCAC 
CYTCCGGKTN TTATANCTYC CCYTTTGSTY GGGCCCCCCN CYYTGKKNAT ACCCCTNCCA 
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AWTCCCAACN CCCKCCAANA RCYKGGGGCC CCCNCCAACC CGGGKGAAKA WTAATTTAAA 660 

CCCYAACMAW ACTWMMNACC CNNGGGSCCY AAMCGTYYNR AGGTTTTSCT NAAAGAAASA 720 

ANTCGGAAMC CGGNTS TACC AAAAASCCCK CCNWTCCCTC CRASATTGSC NCCSAAWKSA 780 

AKGCCCCCNY TCSGCNWNNC CSGCGGKKKT KKGTTNCCCT WMRCWMWYTS GGCCNASCCN 84 0 

CKYYSSMYCC CCCCTCCCCM CTCCGNKTCC CCAMCCYANC MGGCCCCYTM GKKCCCWKNT 900 

YKGCCCCCCC AMMNNNGGGG WGACCCTNGG CCCCMKRRGM TCCCNANTGA MCCTCWGNRA 960 

MKCYCCNRAR ANMCCSCNCC NGCNCRCKNN 990 

(2) INFORMATION FOR SEQ ID NO: 306: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 223 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:306: 

00 AATTCGGGTG GCAACGCGGG CCTGTTCGGC AACGGCGGCG CCGGTGGTGC CGGTGGGGCT 6 0 

£ GGTGGTGGCG CCGGCGGCGC GGGCGGTAAC GCGGGGTGGT TTGGTCATGG GGGCGCTGGC 120 

50 GGCGTGGGTG GTGTANGTGC GGCCGGGGCC AACGGTGCTA CGCCCGGTCA GGATGGGGCG 180 

■P GCTGGTGTTG CCGGGTCGGA CRACRCTCGT GCCGCTCGTG CCG 223 

w 

^ (2) INFORMATION FOR SEQ ID NO: 307: 

PU (i) SEQUENCE CHARACTERISTICS: 
rj (A) LENGTH: 418 base pairs 

fjfj (B) TYPE: nucleic acid 

O (C) STRANDEDNESS: single 

fy (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:307: 

AATTCGGCAC GANGCGGCAA CGGTGGCAGC GGCGGCACGT CNGTTGCCAC CGGGGGGGCC 60 

GGGAACGGCG GTGC CGGCGG CGCCGGCGGC GGGGCCGGGC TGATCGGCAA CGGCSGCAAC 120 

GGCGGCAGTG GCGGAATGGG CGATGCCCCG GGCGGCACCG GCGTCNGCGG CATCRGTGGG 180 

CTGTTGTTGG GTTTGGACRG CGCCAACGCC CCGGCCAGCA CCAACCCGCT GCACACCGCG 240 

CAGCACAGGC GTTGGCCGCA GTCAACGCGC CCATCCAGGC CGTGACCGGG CGCCCCTGAT 300 

CGGCAACGCG CCAACGGCGC CCCGGGCAAC GGGGCCCCCG GCRGGCACGG CGGGTGGTTG 3 60 

TTCGGCGGCG GAAGGAACGG CGGGTCCGGC GTCANCRGCG GGGCGGGCGG AAATGCCG 418 

(2) INFORMATION FOR SEQ ID NO: 308: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 104 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 



(ii) MOLECULE TYPE: Genomic DNA 




(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 1036 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:310: 

AATTCGGCAC GAGATCATGA ATAGCGGGCT GGTCAGCACC GAAGTGGTCG GCGATCTCGC 60 

GAGCAAGTCT CGTCTGCTCG CCCAGCAGGA GGTCGGCATC GATGCGGACA CCTGCGATGT 120 

CTTGGATGGT GTTCAGTTGC AGGTAAGGCC GACGCCGCAG CTTTGCTAGC AGGGTGTCTT 180 

GGCTCTTCGC ACGTGAGGTA ACCAATAACT CCGACGCAGA CCAACTCCGG CCCTCGATCC 240 

GGGTACCAGG CTCCGCCGGA GCCAGCCGTT GTGCCCCCTG GGC CGAAGGT CAGCTGCTGT 300 

GCGATCGAAG TAAGAAACCG CGCCATGCCC GTCGCCAAGT ACGACTGACC GAGCAAACGA 360 

ACGATCGTCG TCCTTTCCGT SSSGG-TAATC GAMGCCAGCA ACCGCACGAG CCACCAATCA 42 0 

TTGGGATTCG GCCACTGACC GACCAACCGC CTGTGCGACA CCCCAGCGGA ATTGGTGGTC 480 

1^ TTCCGCGGGG CCGCNAACGG AATCANCGSG ACGCGCTCGC CGAASCANCC GCATANCCNT 540 

Q ACATANCAAC GGNNTCTGCG CCCACATTTC GGGS TTMTGC CCCTCNGCAA CSSNAAYNCC 6 00 

Q CCCAATTCYG AACNAAAAAA TTGGYCCATY ARNGTYCTCM CCAAAAACCN AWTCCCCKTA 660- 

TCCCCCGGGG GGGRCCCCYY NMNAAAACGG CCCWWAANCC CCSGGGCSCC CGGGTTRWTN 720 

P CCCCTTGTCG GCCCNCCSGG TTTGGTCMCM GGS CMMTNWN GGGNTGCSCC CCCNCNAAAA 780 

JSJ AAAAAYCKNG NCAAATYAAA CCCKYCMAAA ASKTGGGSSC CCCMARCCGG GGKAAKKWWA 840 

% ANTTAANCCN KAAAAAAAWW NCANNMCCCC NGGGNC CTAA GGKYTTAGGG GTTS TTNANG 900 

ARAAAATMTC CANATMNSSK TTNNAAAAAA ASCCSWAKCC CCCNNNKKNN CCAAWKAARR 960 

SRCCTTCGGG TNWNSGGGGG KKKKKTNCMS KMNMMTTWGR CCCNCCGCCN NNTWKCCTTN 102 0 

P TCCNYGGNGC RNCAGN 1036 

LH (2) INFORMATION FOR SEQ ID NO: 3 11: 

ru 

^Lj (i) SEQUENCE CHARACTERISTICS: 

Q (A) LENGTH: 1060 base pairs 

ru (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 311: 

AATTCGGCAC GAGTCGATTC GATCGAACAC GCCCGCACCT GGCCAGGCCA CATGGGCGCG 60 

GCCATGGCCA ACGCCTACTC GGCCAACCCG AATCCATTCG GCGTCTCACC GCAACCCCCG 120, 

AAACCGGCGA CCGCGGCATG GATCAACCCG CCCACCCCAG ATC CGAAATA GCGTCCACAT 180 

AATGAGACAC TGGCGCAAAG AGCTTGACAG GCGCCGCACC ACGCAAGCTG TTAGACGTGT 24 0 

CGGTCTTGCA AGAAGCGGGT TGGCCACCCA AGATCACGCC GCCCAAGGGC ATCGAGTCAA 300 

CGTTGCGGTG GTATCGCGCT AACGTCGGCG CCGCCAAGAA ATGACGGTGC GCATTACCAT 36 0 

GGCCCTGCTG ATC AC CTTTG GCCACCTGCG CACCANAACT ATGANCAGCC TTATGCCGAG 420 

TCTCGTGGAC ATCGGCAGCC GCTTCAAAAA CTCCTTGTCG ACAATSGTAT TGCTGANCCG 480 

CCGAATTCTT NTRCTTGCAA SAACACTNCA TGTTNCSGGT NAACAACCYT GGTTNGAAAA 54 0 

ACANCCAATA TTGAANTCCC ANTCGGGCAM GAAC CNGTTM CGGAAGKTGK TGGGAACGAA 600 

TGKTGCCCAA AAATCCCGGG NGGTRAAAWW CCCNSNATGG MS AATTTTS C CTNGAACAAM 660 

AAAAGGTCCA AGKYCAAAGG NGCCCCCCCC SGNAAATTGG TGAACSCAKA WYANRTTCCC 72 0 

WWWTNCAAAT MTTNGGGTCC KNNTCCCCWT AAANGGGSCN CCCCNCCRGG GMGTYTCCCC 780 

NWNMGGGMGN CYYCSCCCCA AAAAAAAMMM MTTTCSGKGG SMGGKKCCCC CCSGGTYWGG 84 0 

GKKYTTAAAC CCGGKGGGTN CAAAAAANAN ACCCCCCAMS NGGGGGGAAA ATTTGNAAWT 900 
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AAGGKKKTKC SCMACCCCAA AAANMMNNCN AWNCCCGMGK SARGGGGRNY TTMKAGGGMG 96 0 

GNYCCCCCCW YCGGGGGGNA NAAYAAAAGK NGSNGRGAAT NTTNTTTTGK RSSSRNKTTT 102 0 
TYNTCCTYCN CCNMGNRWWG SRAMNTGKTS NSSGGGSGGC 106 0 



(2) INFORMATION FOR SEQ ID NO: 312: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 104 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 312: 



AATTCGGCAC GAGCTTCACC AAAGAGCTGA CATGCCGGGT GATGCGACAT CGCATCGAGG 6 0 

^ GCAATACGGG CATGGATGAN CCGAANGGAN TCTGGCGTTC GCTCAACTGG ATTACGGTTC 12 0 

JJ CCAAGGTGAA ACGCTTTGCG GCGAAAGATG CGACGCTTAA CTTGCGCTTC CACCGTGCAA 180 

TGTTNGTATG GATGCTGGAA CCGCGCTGAC NGATAANGAA TTCGCTGGTC GCCGGGCACN 240 

yjJ ATGGATGGTC CKS TTTTCNC TCCGCSGTTA AATTGCSTGT GCATCATCTG GCAGGCTATG 300 

■P TTCCCGCTAC RCTGCAGCCC ATCATGGATG TGCGGCTAAC GAANAAGTTA TGACATGGCG 360 

IrU CAAGCGAMTC GGGCATS CNC GCGGCAMTTT CGCAACCTGC TGTGTNTGAA GCGTMTCAAC 42 0 

«P CGAATGCGGC GCTYAAAAGC NGGCTTGCGT TGATTMMAAC CNAACCCNTN CNATYCTTTG 48 0 

W CCGNGNMNTG CGTTCTCTCC AACTCCGKKG SYTGCCNCCG TGAAACCCMA CTNCCCCCCC 540 

s GTTGGACTTA MRTNTTCAAA AAMCGGMTNA ACCSGAATNN SAACCTNCCR TCAAANTAMM 6 00 

O SAANTCGGGC TTYGGGNRCC CCCCNGAAYW TTCKNCNGGG GMNNTYCTCN GGTTYNGGCG 66 0 

fU SAAACNTTTG CCRTNCYMNN TTTACAMGGC NCMTNMTTGM GGGSCSNNAS GWCCCGGGKK 720 

fy. TNTTTNCAAW TCNCNSKTTT TTKGGGGGGG GGCYGRTRMC NCGGGCCCCC GGCCCKKMAA 780 

IH AAAAAMCMSA RRCCNCYGGG KKCCCCCCCM NNATNGGGCG YKCRAAACAA ACCCCAANRA 84 0 

Q TNGNGMGGGC SMACCSGNGN GYNAAAKGGT TSNSCTMANM MKGMANNNCT SGMSCCMNSN 900 

ft] NCTGMGGGKT TTKGNNGARN AANAMKMGGM RCGGNCGCNN GAAAGGGSMS GSCKSCNNGN 960 

NGASNGWMGN CRNNGANRCC NCNGYGNMRN NNGNNNGNNN GGGRKNNACN NMKMCAWSMC 1020 

NSNMMGNNNS CGYMTNKCGC 1040 

(2) INFORMATION FOR SEQ ID NO: 313: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 348 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:313: 

AATTCGGCAC GAGACAANGG CGTGAAATGG GATCCGGCCG AGCTGGGGCC CGTCGTCAGC 6 0 

GACCTGTTGG CCAAGTCGCG GCCGCCGGTT CCGGTCTATG GGGCCTAGTT ATCTGCGCCG 120 

AGCGTGAACT CAGGGCGAGA TTTCGGCCGT TTTCTCGCCC TGGCTTCACG TTCGGCGAAG 180 

TKGGGAACGG TCAGGGTTCG CAAACCACGA TCGGGATCGT GCGGTCGGTC CAGGACTGGT 24 0 

ANTCCTGATA CTTKGGTACA TCGTGACCAA CTGTGGNCAA TATTCGGCGC GCTCCTCGTC 300 

NGTCGCGTCC CGCGCGGTAA GGTCCANCAC TTCCTTTTTC TCGTGCCG 348 



(2) INFORMATION FOR SEQ ID NO: 314: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 332 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:314: 

AATTCGGCAC GAGAGACCGG GTCGTTGACC AACGGACGCT TGGGCGCGGG CCCCTTGCGT 60 

GGCATCAGCC CTTCTCCTTC TTAGCGCCGT AACGGCTGCG TGCCTGTTTG CGGTTCTTGA 120 

CACCCTGCGT ATCCAGCGAA CCGCGGATGA TCTTGTAGCG CACACCAGGC AGGTCCTTCA 180 

CCCGGCCGCC GCGCACCAGC ACCATCGAGT GCTCCTGCAG GTTGTGGCCC TCGCCGGGAA 240 

TGTACGCCGT GACCTCGAAC TGACTCGTCA CTTCACSCSG SGAAGCTTCC GAAGGGGGGA 3 00 

GTTCGGCTTC TTCGGAGTGG TGGCTCGTGC CG 332 

□ (2) INFORMATION FOR SEQ ID NO: 315: 

o 

ffj (i) SEQUENCE CHARACTERISTICS: 
JS (A) LENGTH: 962 base pairs 

Dp (B) TYPE: nucleic acid 

J (C) STRANDEDNESS: single 

LJ (D) TOPOLOGY: linear 

s 

q (ii) MOLECULE TYPE: Genomic DNA 

Ljt (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 315: 

zl AATTCGGCAC RAGTCGGTCT AGACGGATTC AATGCTCCCG CGAGCACCTC GCCACTGCAC 60 

y ACCCTGCAGC AAAATGTGCT CAATGTGGTG AACGAGCCCT TCCAGACGCT CACCGGCCGC 120 

^ y CCGCTGATCG GCAACGGCGC CAACGGGACT CCTGGAACCG GGGCTGACGC GGGGCCGGCG 180 

GGTGGCTGTT CGGCAACGGC GGCAACGGCG GGTCCGGGGC GAACGGAACC AACGGCGGGG 240 

ACGTGGGGAC GCGCCCGGCG GGATTTCTTC GCACCGGSGC ACCGGCGGGG CCGGCGGCGT 300 

CGCACAACGG CACCGGCGGG GACGCNGCGC CCGTNGGGCG GCTTCTKGAT GGGCTCCGGC 360 

GGTNACGCGG CACGGCGGCG CCCGGCTCAC CGCCNGTTGG GACGCGGGGA CGCGTNACCC 420 

CGATCTTCTT CCGCNCCCCG GAAACCGCGG GGCCGGCCCC ACATTAKACC CGGCGGNACC 480 

GCGGMCCCGG CGGAACGGNG GGYNTTTTCC AACGGCGGGG CCGCGGAACC GNMGGS TGTT 540 

CCTTNGGSGA AGGNCCAAKT CCCGKCTANC YYAATCCCCG ANGGKTGAMC CTSATGSNCA 600 

MYTTMAGGAA CYTNCCCANT KTTSGRACCW CRC CNGGAAA ASRAWNKNGT KGGCAAACNA 660 

NNTNCYTTKN NATTKGGNNA AAAANCCCTY CCWCSGRACT NCCCCCCNGM GRGMCNNTNN 72 0. 

NTTTYGNCNN CCCGGSNAAM RNTTKATTTC NGGGGGNTCN GGGTKMNNNA AACCCCAAAM 78 0 

MNRNNKCSCA ANGGGKSNGC NKNNMMNSGT TTTYCKNMRA MRNWTYKNKN NTCNGARSRN 840 

NAAMCNNSNK NGKKKNNKAA ARNNTTWKTN KNSCNNNCNN GRRNGVRGGC CKMKGSNMNG 900 

MCWHNAWRNG NNGSNCNCKC NNKMNAAAAA AAS GGVNCKS NSMKNKKKKG NRGGGGGGGG 960 

GG 962 

(2) INFORMATION FOR SEQ ID NO: 316: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 323 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 
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(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:316: 

AATTCGGCAC RAGAAGACGC CCGAANGTTT GCGCTGGCTC TACAACTT£A TCAARGCGCA 60 

GGGGGAACGC AACTTCGGCA AGATCTACGT TCGCTTCCCC GAAGCGGTCT CGATGCGCCA 120 

GTACCTCGGC GCACCGCACG GCGAGCTGAC CCAGGATCCG GCCGCGAAAC GGCTTGCGTT 180 

GCAGAAGATG TCGTTCGAGG TGGCCTGGAG GATTTTGCAN GCGACGCCNG TGACCGCGAC 240 

GGGTTTKGTG TCCGCACTGC TGCTCACCAC CCGCGGCACC GCGTTGACCT CGACCAGCTG 300 

CACCACTCGT GCCGCTCGTG CCG 323 

(2) INFORMATION FOR SEQ ID NO: 317: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1034 base pains 

(B) TYPE: nucleic acid 
jj, (C) STRANDEDNESS: single 
q (D) TOPOLOGY: linear 

1=3 

fQ (ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3l7: 

L 



AATTCGCAGT GTGTGTGGCG GCGTCCAGAA GAAGATGATC GCGAACATCG CCAGCGCCGG 60 



CCAGGCTATG GTGCCGGTGA TGGCCGACCA GCCGATCATC ACCGGCATAC AGCCGGCCGC 12 0 

L. CCCACCCCAC ACCACGTTCT GTGACGTGCG TCGCTTGAGC CAAAGCGTGT AGACRAACAC 18 0 

~f ATAAAACGCG ACGGTGACCA GGGCCAGCAC CCCCGCCAGC AGGTTCGTGG CGCACCATAG 240 

|J| CCAGAAGAAC GAGATCACCG TCNACGTCAC CCGAGTGCCA ACGCGTTTCG GGTCGGCACC 300 

|^ GCTTCCCGCG CCAAGGGCCG GCGCGCGGTT CGCTTCATCA CCTTGTCGAT ATCGGCGTCG 360 

GCNAC CAGTT GAGCGTGTTG GCGCCGGCGG CSGCCATCAT CCCGCCGACN ANCGTGTTGA 420 

GCATGANCAG CGGATGAATG GCGCCGCGGC TCGTGCCGCT CGTGCCGAAT TCAACTCCGT 480 

fU CNACAACTTG CGGNCGCACT CGAACCCGGG TGAATGAWTG AATTTAAACC GSTSAACANT 54 0 

AACTACATAA CCCTTGGGGG CTCTTAACCG GTYYTGAANG GGTTTTTTGC TTAAAGGAAG 600 

AACYATTTCC GGATANCTGG CSTTNWTARC GAAAAGGCCC CRCCCATNGC CCTCCACAGT 660 

TTSCCCCTGA ATGGSAATGG MNCNCCYKNR CNGGGNCTTT AACRCSGGCG GGNTTTTGKT 720 

MCCCNNCTKA CNTTMMMTGC ARNNCNGGCC SKCCCTTCCK TNTYCCCTCC NTCCCCCNST 780 

TNCNGKTCCC CNNAMNYTNW ACGGGGGGCC YTNGGGKCRM TWTKKTTTGG GCCCCMCCCC 840 

MAAANASAAN GGGGKRNGTY CSTTTGGCNC CCCAMAARGG NYCCCCCCAM YTNRRKMCSY 900 

CNNTNKGGNN CTGTNCKNCG GAARAMAMCC KCCCCGNSTS STTNGTYWAG GNRWKGNSRG 960 

CCSCCCCGGY MNNNAAYAWN WMNATNCNNS STNANMAKKN NNNNNNNSCN WNGNGNNTCN 1020 

SCNSNGGKBC CSCC 1034 

(2) INFORMATION FOR SEQ ID NO: 318: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 331 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:318: 
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AATTCGGCAC GAGC CCACAT 'CCGGGGCCGC TCGTTGCATG ACTCGTTCGT CATCGTCGAC 
RAGGCACAGT CGCTGGAGCG CAATGTGTTG CTGACCGTGC TGTCCCGGTT GGGGACCGGT 
TCCCGGGTGG TGTTGACCCA CGACATCGCC CAGCGCGACA ACCTGCGGGT CGGCCGCCAC 
GACGGGTCGC CGCGGTGATC GAGAAGCTCA AAGGTCATCC GTTGTTCGCC CACATCACCT 
TGCTGCGCAG TGAGCGCTCG CCGATCGCCG CGCTGGTCAC GAGATGCTCG ANGAGATCAC 
CGGGCCGCGC TGAGTGCGCC TCCCGCGAGC A 

(2) INFORMATION FOR SEQ ID NO: 319: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1026 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE-. Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 319: 

AATTCGGCAC GAGATCGTCA CCCTGGCGAC CAGTGCACCC AGGCCACGCC AC CAGTTACG 
GCTGATGGGC CAGAAGATGG ACCAGGTGCT GCCCATCCCG CCCACCGCAC TGCAGCTGAG 
CACCGGGATC GCGGTCCTCA GCTACGGCGA TRAGCTGGTG TTCGGCATCA CCGCTGACTA 
TGACGCCGCG TCCGAAATGC AGCAGCTGGT CAACGGTATC GAACTGGGTG TGGCGCGTCT 
GGTGGCGCTC ANCGACAATT CCGTGCTGCT GTTTACAAGG ATCGGCSTAA GCGTTCATCC 
CGCGCACTCC CCANCGCCGC GCGGCSGGGG CGGCCCTCTG TGCCGACCGC CCGAGCGCGT 
CACTGACGCC ATCTCCGTCG GCGTTAACCC CGTGAGAAGG TGGGTCGTGC GCAAGTTGGG 
CCCGGTCACC ATCNATCCGC GCCGCCATGA CGCNGTGCTG TTCCACACCA CNTSNGACNC 
CCCCCAGGAA CTGGTCCGGC AMTNCAGGAA NTYCGTGTGG GCACCNGCTT CTTCCGKTRT 
GGCYTAAACT TCCNATS TTN CSGCSGGCCT CTGGCGTTNC GNCCGGGCCG NTCTTNCCAA 
ATCGGSMMAA ATCCCCANMC AAACCCCCCG GGTCTTGSGG GCSGGGNGGC GGCCNAWNCC 
AAACCCCCCC NTTAAANTCT TTGKTNCCNN CNCSGGCNCC NCNAANS CAN CCCTTTKGGC 
NCTTCCCCCC CCCAWTTTAA CCGAKCGSCN AAYCCCAAGY TMMGKCCYCY KNAAAAAAAA 
AATTTGS CSG CCCCAANTAA ATTCCCNGGC CCYTTGGGGG CGRANCNYNT TTTMCCSNSS 
TKGNNNAAMC NGGANC CSGG KAAYTMMTKG NAAYCGCCSN AAMBNTTTTC TAANNCCCCN 
YNCCCSGAAA ATTNNAMAAM CMNNKTGSNG GGGGKTT SNC SGKKGRAGGM AAAAAANRSN 
SKTTNMCNNN SANMNCNSNN SGGNSNNNNN NNNCNCGYKC CSNAANMCCC CGCGGGGGGG 
CCMMCC 

(2) INFORMATION FOR SEQ ID NO: 320: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 324 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 320: 

AATTCGGCAC GAGAAGACGC CCGARNGTST GCGCTGGCTC TACAACTTCA TCAARGCGCA 
NGGGGAACGC AACTTCGGCA AGATCTACGT TCGCTTCCCC GAAGCGGTCT CGATGCGCCA 
GTACCTCGGC GCACCGCACG GCGAGCTGAC CCAGGATCCG GCCGCGAAAC GGCTTGCGTT 
GCAGAAGATG TCGTTCGAGG TGGCCTGGAN GATTTTGCAN GCGACGCCNG TNACCGCGAC 
GGGTTTKGTG TCCGCACTGC TGCTCACCAC CCGCSGCACC GCGTTGACGC TCGACCAGCT 
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GCACCACTCG TGCCGCTCGT GCCG 324 
(2) INFORMATION FOR SEQ ID NO: 321: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1010 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 321: 

AATTCGGCAC GANGCGTGCC GCTNAACACC AGCCCGCGGC TGC CAGATAT CCCGGACTCG 60 

GTAGTGCCGC CGGTGGCGTC GTTGCTCTCC TGACGGGGCG CGGCGAC CAT AAGGTCGCTM 120 

ATGCCCAGGT AGCGGCCCAG GTGCATGGAG TCGATGATGA TGCGACTCTC CAGCTCGCCG 180 

1^ ACCGGGAGCT TGGCATCGGG CCTGATCAGC CAGGACGCGT AGGACAAGTC GATCGAATGC 240 

q ATAGTGGCCT C C AG AGTGGC CGTGCAMTTC CNGCGTGCTC CACGGCAAAT GCCTTGATTT 300 

35 CTACTCCGCG TANTGTTCCC GCATCGCCTG CGGGATGAAT GGGAAC CGCA SGATGGCGAC 3 60 

GAACGGGTCT GANCTCAGGT TTGC CGCTTT GCGCACAGTG GTCNACANCC GGTACTCGGC 420 

^ ATANATCTGG CCCNAAATCG GCGCCGACGG CGCCCACNAT AANAACGGGC ACNACAATCG 4 80 

£ CCGCCCCGGT CACCCNAACA ACANCTTGSC ATCGGATTTT GTCCCCANCG CTCAANCCGT 54 0 

"p CCCGAACGCC TCNTCCGGCG NACTTTTCTT NNAWTAACTG CCGCTTCCGK CCCTGGNGCA 600 

^ WTAAATGGGA AACCCTTNCC CCACCTTGAA GGGGTTGTTG NATTTTTACT GSTAACCCCG 660 

* AATTNTTCCG GANTCGGTCN KCCGGGSTTT YSTNTTCCCC ACCTTNGNAN GGGCCGGCCA 72 0 

JL AGSTTTTCTT SYTGAAGGGG GAAACCCAAC TTTNTYTYYN AACCSCMNAA MYMTTTYCSG 780 

y MNAASCCNKT CCCCTTTAAC CAMGGS GGTN AACCGKTMNG NGGKTAAAAA GGGSKNNKTG 840 

J NCCCCYMANG GGGGGRAAAA TSTKTCNNCG GGGCCKAAAW ACCMMMMYGN GTGKKKNKSS 900 

fy GCSAAATTTT NMMRAACTKN GGGGCCSSGA NNTTTNAAAG MSCCCCCSNN GSTGKCCCNN 960 

yl NTTTCCNNAA WMKKGKNWNM SNMNSCSNGG GKYNSGGSNN NNAAGMGGGG 1010 

□ 

TlJ (2) INFORMATION FOR SEQ ID NO: 322: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1010 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:322: 

AATTCGGCAC GANGCGTGCC GCTNAACACC AGCCCGCGGC TGCCAGATAT CCCGGACTCG 60 

GTAGTGCCGC CGGTGGCGTC GTTGCTCTCC TGACGGGGCG CGGCGACCAT AAGGTCGCTM 12.0 

ATGCCCAGGT AGCGGCCCAG GTGCATGGAG TCGATGATGA TGCGACTCTC CAGCTCGCCG 18 0 

ACCGGGAGCT TGGCATCGGG CCTGATCAGC CAGGACGCGT AGGACAAGTC GATCGAATGC 240 

ATAGTGGCCT CCAGAGTGGC CGTGCAMTTC CNGCGTGCTC CACGGCAAAT GCCTTGATTT 300 

CTACTCCGCG TANTGTTCCC GCATCGCCTG CGGGATGAAT GGGAAC CGCA SGATGGCGAC 36 0 

GAACGGGTCT GANCTCAGGT TTGCCGCTTT GCGCACAGTG GTCNACANCC GGTACTCGGC 420 

ATANATCTGG CCCNAAATCG GCGCCGACGG CGCCCACNAT AANAACGGGC ACNACAATCG 480 

CCGCCCCGGT CACCCNAACA ACANCTTGSC ATCGGATTTT GTCCCCANCG CTCAANCCGT 540 

CCCGAACGCC TCNTCCGGCG NACTTTTCTT NNAWTAACTG CCGCTTCCGK CCCTGGNGCA 600 

WTAAATGGGA AACCCTTNCC CCACCTTGAA GGGGTTGTTG NATTTTTACT GSTAACCCCG 660 
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AATTNTTCCG GANTCGGTCN KCCGGGSTTT YSTNTTCCCC ACCTTNGNAN GGGCCGGCCA 72 0 

AGSTTTTCTT SYTGAAGGGG GAAACC CAAC TTTNTYTYYN AACCSCMNAA MYMTTTYCSG 780 

MNAASCCNKT CCCCTTTAAC CAMGGSGGTN AACCGKTMNG NGGKTAAAAA GGGS KNNKTG 840 

NCCCCYMANG GGGGGRAAAA TSTKTCNNCG GGGCCKAAAW ACCMMMMYGN GTGKKKNKSS 900 

GCSAAATTTT NMMRAACTKN GGGGCCS SGA NNTTTNAAAG MSCCCCCSNN GSTGKCCCNN 960 

NTTTCCNNAA WMKKGKNWNM SNMNS CSNGG GKYNSGGSNN NNAAGMGGGG 1010 

(2) INFORMATION FOR SEQ ID NO: 323: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1092 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



i±±) 'MOLECULE TYPE: Genomic SNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:323: 
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NGNGGGGWNS 

CGAAYTCGGC 

CCACTCCACA 

GCCGGTKTTY 

CCGGATTGAA 

ATTAAAAAAC 

KYCTYCTCCK 

GCTGYCMMTK 

CKTGGGCWCC 

CCCNANTNCN 

CAAANCCCCA 

TTGCTKS TTT 

AC CNCGGGCN 

ATRMTCCCTT 

GRRNNAC CMA 

ASCCCWCCCC 

CAAACMAAAA 

AAMGCAMWSY 

GAASMCCCCC 



NTCAYCAYCA 
AMGAGGGAMT 
AAAAACCGTT 
CCRATYCCCG 
ACTGCCGGKT 
CGGKKTTGGN 
YCGGTACCCA 
TTGCKGGSGT 
AGMCCGRAAA 
ATCCGNTNCC 
TAACNTNCGR 
CGGGMWCCGT 
CCMCCCSNAA 
CCCCVSCAAT 
ACCCCAAGTY 
CCCCCCCRAA 
AMCCCCCSGM 
KSKTNMAAAA 
CS 



YCACSGGGYW 
CKCTMGCNCC 
GTGTGTAYYT 
TKTTGTAMCT 
TG AAACTGC C 
GCTGSNCGTG 
AAYCTGGGTA 
C CNAATTT AS 
AAAKAATAAT 
MSCNCCCCCA 
GAASAAACCC 
GCCGCCNAAA 
NWTATYTCTT 
TGGYCGGRAT 
CCMNAAAATN 
AACCCCCCNA 
RMACS GGGGN 
GGAAGRANCN 



CWATTGCGGC 
GCTGTGCAAN 
SCCGRAAATR 
GCCKGGGTSR 
GKTTTSGCSA 
CCAAATNCGR 
TCCTATACTG 
CACCASCGGT 
RAKAAKGGTG 
GCGGTNAAGK 
CTYCNCGGGG 
YCCCAAASTA 
KGGCAANCCC 
NCGSNCCYTY 
GKCCCCGCTC 
RKANTNCCCA 
NMCCCCGKKK 
TYCCSANANM 



CGCAWCTTGT 
CCAATRAGGC 
AAGGCGCCGG 
AAAYCCCCGG 
TCCGGKWATT 
AYCCRATAYC 
GYCCCTAAAK 
TCCTTCCATA 
CATNYCCAAA 
TKSGGAAYTT 
GYCNWNCAAA 
CTTTYTGGGT 
CSAAACCTTR 
TCAAAKKKSC 
CNAACACGNK 
AAAACNYNGK 
KKTTTTCTTT 
TCCCNYWRSW 



MAAS AGATCT 
CTRATAATTY 
TNT CAAC WYC 
TGTTGGAYCC 
GAMSTCRCGG 
CCATGGCCTG 
GCAAWYCKGG 
CCNAAACNCG 
ACCNCCGCCN 
CTMMAACCCC 
ACASCNTTAT 
CCNAGAKAAA 
TCMNACCNCK 
CAKWWNNGNG 
TYYTCCSAAA 
GGCCCCCCCC 
TKCCMRSCCC 
CCGSWGMGNA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1092 



(2) INFORMATION FOR SEQ ID NO: 324: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1251 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 324: 

GGGGGGGNNN NATACATCWT CYGTGYACCG GGGMTCTAKT GGCGGGC CGC AATCTNGTCA 

ASAGATCTCT NAMTTCGGGC ACAAAAACTW GACAAASYMT CGNGCNMTCC GTGTCCTNKA 

TCGCAAAACG NGTRACASAC ASACACRTAT GTGTGCCCAC CASCAAYTCK TTGGGACCTC 

GCTRACCGGY TGCCCRNACG CCACGYTGCS CWTCTATCCC RACGCCGGCC ACGGGYGGGG 



60 
120 
180 
240 



# 
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ATATTCCAGG CACCACGCCC AGTTTGGTGG ACAATGCCCT GGCAKTTTCC TCRAANTTCG 300 

TGAAACCGAA TTCNSMTTGA ACCNCCAARG CCCCSNCCNR AACARTTGGG WTCCGCGGTT 360 

CTCCCCACCG KTTTCCGGGG GTNTCGGCAN AANCGCACCC WTGGWTTCTM TCNCCGCACC 420 

GGGCGGACAA NTCGGGTTGC AATTTTGCRA AYCGGGGCCG GGATTCCSCA AACGGGTGCC 480 

GAAACTGTTY YCRAAMACCG GGAKC CGCAA TTTCCGGGCR ANAAATTTCN YCNCACCACT 54 0 

GCTTRTACTT CCCCGACCGT AACMANTTTC ATCGTCNTNN CCTCTGC£CT TGGGGCAGGG 600 

CKAAAYACCG CMTTKGGTTT CGCAACCTGC GGCCCAANTC CCNAMCCRCA CTTTCNATTT 660 

GGNTCGAATT SCCCCCCGGT RANAACCSCC NTGGCCNNYT CGGASSAAAA NGGGCCCTNT 720 

KGGCNSCCCC AGTAANACCC TACCNNAYTS CAWTCTTTGC CAAAS TTKGG ACGAANS KTG 780 

GGNTTCCGGK ATTTYYTTGS GGNCNCCCTN TATNGGSNTN GGGCCKCYNC NCSTKTGKCA 84 0 

NASSKAYCCS NGNKGGGGGT ACCCCCCTMG GGGGGTTTTT NSSGCCCCCC AWAYGNKSTG 900 

GCCCCCNNGG GGAAKAATWT MWWTMCNSGG GGGAAWTTTT NTSTGGAMCS SGGACYCCCR 960 

GGGGGKTTTT TCCCCCNCSA NNAWANGGGG GGGGGANAYT NTGNS GNGGG KWNTTTATTT 1020 

YTYYCYCCTM TKACMSGGGG GTTTKKAKNG GGGGGAGAAA ANAAAAAAAA RAKGGYKNTT 1080 

TSKNCACNCT GKWNWNWANR NAGAGKTCCT CKCKCCNCSG SNTTTCTTTT MGNSGSYGGG 114 0 

GNNGNNNAAA ACNKSRMMAC KCSYTYCCCG CGYCTCCTCC NCNGGGGYGS NGSCGNSTYN 12 00 

GNNKGRKWTA TNTMGNCGTN SCCTCCNCCC GCKNKNTGTC TMTCNMYGSG C 1251 

L_i 

(2) INFORMATION FOR SEQ ID NO: 325: 

m <i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1099 base pairs 
^ (B) TYPE: nucleic acid 

" (C) STRANDEDNESS : single 

7K (D) TOPOLOGY: linear 

L.' (ii) MOLECULE TYPE: Genomic DNA 

JJj (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32 5: 

^1 AAYTCGGCAC MGAGTATCAC CAAKCTGYGT GGCCCAGCAA AGTGGAGCTA TTACTACCTG 60 

O TATGTGATCC TCRACATCTY CTCCCGCTAC KTGGTCGGGT GGATGGTGGC CTCGCKTGAK 120 

fU TCRAAGGTCT TGGCCRAACG GCTGATCGCG CAAACCCTTG CGCCCAGCAC ATCAKCGCCG 18 0 

AACAGCTGAC CTGCMCGCCG ACCGGGGGYC GNCAATAACT CCAAACCGGT GGCMCTGCTG 24 0 

CTGGCCNACY CCGTGTCCCA ANTCGAACTC ASCCSGCNMA CCAKMAACKA NAACCGTTGT 300 

CTGAAGCCCA GTTCAAAAAC CTCAAGTWCC GGCCCRACTT CCCGAAACGG TNCGAGTCKA 360 

TCRSAGGSGG CCGGGTGCMC TGCAACCGGT TCTTCGGNTG GTRCAMCCCN AAAMCAAGCA 420 

TTCCGGGMTC CGMMTGCCCA CGCCGCCAAS TTTMCTACGG GCSGSCCNAT CAAATTCGCC 48 0 

GGGAACSGSN CCMCCKTCNK GGAMACGCCC TWCCAAAACC CYCGAACGGK ATCCTTCKGY 540 

NAACNC CCGA RCNCCCKSKT TCCGGGCTTC NMSGCGAATA CCCKNSCMNT CCGAATCCAA 600 

TTCCCMKYGG CTTTTYYYCC CCCCGGCCCC AAAYNGGGYC CCTASSNMKC KNCCAMNANT 660 

CCNWATCTGG NGGTCCCNAN KYYGGCGTTC NMAATSAMNA NMNRGGGTYT TSCYACCMMN 720 

AACCGKNNKG KCCCCMKCTK MANAAAKATT RATCAMKWNG GGNKCKCNCN NAAMACCSCN 780 

CNCYNCWYTC TMYCSSKWGC GCSMYNANCA SNGGGGAGGW GGS GRMKMCT CTMTCTCNCT 84 0 

MGCGCCKNTN TYCKSGAKAT ACASMNKTCC GCGCNGCGCN MAAMANRAKA CTAKCCGYGN 900 

CCSNSTMTYN CTSNNMKMNN TCCWMWNATC NTYYGKKCNN KCTMKATNWC CSCTSKCNCK 960 

MRAMTCKTYG SNMTCCTCCA TCNCTCKKSC SNMS KNTCKC KSCNCCNCWN CNKCNMKCWN 1020 

GGNSTCRCCY TCTMNNNTCS AGCKCGSKNC WACNCACACK NGWCTYTTCC WKNNMKCNKM 1080 

TCKCKCACRG MTMTCWCCS 1099 

(2) INFORMATION FOR SEQ ID NO:326: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 296 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:326: 

GNGNTATACA TCWCTGTGYA CCSAGGATCW ANTGCGGCCG MAAKCTWSTM CASAGATCTC 6 0 

AAAYTCTGCA MGAGCGG C AC AKAKYSTCGT CCMRACCCGG CAYACWCCWG CNCGCCCCWT 120 

CTTRGACCGG GGCKAT AS MC ACCGTTGGCC CCGGCNCGCA CCTACACCAC CCACGCCGCC 180 

AGCGCCCCCW TRAMCAAACC ACCCCGCKTT TACCGCCCGC GCCGCCGGGG CCACCACCAG 24 0 

CCCCACCGGC ACCACCGGCG CCGCCGTTGC CAAAACAGGC CCGCKTTTGC CACCRA 296 

(2) INFORMATION FOR SEQ ID NO: 327: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1073 base pairs 

(B) TYPE: nucleic acid 
O (C) STRANDEDNESS: single 
O (D) TOPOLOGY: linear 

S3 

4» (ii) MOLECULE TYPE: Genomic DNA 

yy 

«P (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 27: 

UJ 

s_ NGNGSGNKMY ATCATCWTTC TGCACCSNGG MTCWATTGCG GCCGCAATCT TSTMNASAGA 6 0 

D TCTCGAAYTC GGCAMGARCA TCTGCGCGGN GAATGTCCAA AWGTCWKTAA CGGCMATCGG 120 

nj TTTGCCGYCA ACCACKCTRT SCAKATGCGG GCCAMWTYCA AACCRATTAT TTGGGYCGAG 18 0 

fy AAAATTTMCG CKTGTRASCA AC CTGCAGCG GGTCAAS CAA CAGCCTCTRA ACCGTAAATY 240 

yfj CKTAGGTNKT YCCGGCAACA AS CYCRATAA TSCGGCCCGC AMCCACAAAA CCTGANTNGT 3 00 

TNTTCNCRAA NCCGGTYCCC GRAGGGGTSA ACTGCSGTAR GCTTNTCWYC NCCTTRACAT 3 60 

TAAACCCCCC CGGNTCWTCG CCGCGCCCAA ATYCYTGCCC WTKGCNACCA YCCCANCCTG 42 0 

CSGTATGGTS RAANCASTSG GCRAACGGTM MCCSTACCKC TGGCTGATYC KTCGGNTCCS 480 

SNAATTCGGG GATTTACGGS CAMGGTTAAY CCAGGYCCCC TNTGCYTCKY CNACAACCSG 540 

ATCMWCNCCG TACCTKTTAA AATTCTTTGT GGTGGAACCC AWYCKAAAAA NMTNTYCCCN 600 

TCCAMMGGGG CYCGGAAKKT CNACNTGGKT NACCCCTNCC YTTGAASTTT TCYTGNCCCC 660 

GGCCCKAAAS ANACCSGAKC CCCGGAAYCS WTAGGCYTCN TGCCCCSTTA AATTKGNCYC 720 

AATCCKCCAA CGCTCCCCGG GGTCSSCCMT TAAAMTTCCC CCCKSCASNG GAATYCYKSG. 78 0 

GCWGTMATTW CCNCCCNTTT CYYGKNAAAC SCCCCCWKGN GSCTYCCCCN SNTTSSGCCS 84 0 

GGTTSGAMYC AAAAWTNGGG MMCNRAGNCG SGNAMCCSCN GKKGGGSATW TKAAYYCYGG 900 

GGGGGTCNYC CCCCRCSNAA AAGYGTKGGC KCCSSSCCYC CCMARTTTYT CNGGMRCMAM 960 

AC C ANGGGNG CTCCCGTNCW WGGCTCCCSN SNSMAMAAAN NKCKCCKGGS CKGARRNMNA 102 0 , 

MCTCSNGNGG WTCCCKNKTC NSCNSGNCGS YGGNSASWCC YNYCNCCACA ANC 1073 

(2) INFORMATION FOR SEQ ID NO: 328: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1166 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:328: 



# 
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CGCCCCGTTC TTMMMTTCAY TCATTCACCG GGMTCTAGTG CGGCCGCAAK CTTGTCKACA 60 

GATCTCGAAY TCGGCAMGAS ACAATSTCGG GTKGGGCAAT GTCNGGTGGG GCAACTTTGG 120 

GCTCGGRAAT YCGGGGTTAA CGCCGGGTCT RATGGGTSTG GGTAATATCG GGTTTGGTAA 180 

TGCCGGCAGC TACAATTTCG GTTTGGCAAA ATATGGGTGT GGGCAATATN GGGTYCGCTA 24 0 

ACACCGSCAS TGGRAATTYC GGTATTSGGT NACCGGTRAY AAYCTGAC£G GGTNC GGTGG 300 

TTYCAAT AC C GGTAACGGGA ATGTSGGTTS YYYACYCCGS GSAACGGNWW YTTNGKTCCT 360 

TMMCNCTSSM CCKSAAMTSM KMGGTSTYCT MTYCNNGGAS TAMTYNMCCC CCGWAYCKSC 42 0 

WAYCCCTCGT CATYCCMCMC SGSGYCCTCA MNCCACCYTG NGYYCCCTCC MKMTCYCAYT 48 0 

CMNTCCGGTW CCTNTMMNCC CSCNCRYCTC AMCNCTKSGK CACCNATMYC CSACKCHTCT 540 

MCYMCSCAKN MTTCCCCTCN CCTYTNNCCA MCMCSCTCTM TCMAACTCKC CCGGYCKCNC 600 

MYCTCTCKCC AYNMAACCKK TYCYWCNWYC YMYCKCKCAG WYKNMCTCCW ACTCTMYNTT 660 

TCTCTCNKCC CMKACCKNTT CTCWCSCCCC CCACAKAYMC YAWCMTMTCC MCTCKACSCC 72 0 

CYYCNNYCCM NMCWCMTCWC TWNAKCANCN TTCTTCTCTC MMYMTMACKC WCNNTCNCCK 780 

SGACCYTCTC ACTKMKCCKM TCTCCTTMCK CCYMWCNTCC MKYNCCCTCC NMTCMTCKYT 840 

CCTCNCNMPV CYYYAKCAKC NMCTCCCCAN KMCAKCTKCT CCCCCAKMKS ACNCKCCCWC 900 

CCTCCTATCC WCTCTCWCTY ATCTCKCTCW CNYCMYMKMC ACNCKCYAYT CNACTMNMWN 960 

CCANCNCTCT CTNYCTCWCK ACGTYCKCCK CTMCKCNYMC NRWCTYRCCT CKKCCNCCRN 1020 

H CKNMCMKCTM CTCTCCWMKM TCCCWCCCAT CTMMKSTCTC WCNCMTCCCT CNKCCYNYNT 108 0 

S KCYTYC CMYG CTTCKNTCMT MCCWCCYATC TCTMKCCTGT CWCACYMCAC WMTTACWNCC 1140 

ACTCTCTRCW CKCCKCMCCR MTCTCB 1166 



jSJ (2) INFORMATION FOR SEQ ID NO: 32 9: 

^ (i) SEQUENCE CHARACTERISTICS: 

W (A) LENGTH: 1230 base pairs 

JL (B) TYPE: nucleic acid 

O (C) STRANDEDNESS : single 

Ty (D) TOPOLOGY: linear 
iU 

Ul (ii) MOLECULE TYPE: Genomic DNA 

o 

pj (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 329: 



NGNGGNNNNT CWTACATCWN TCTNCACCSG NGMTCWATTG CGCGCCGCAW NCTTGTMNAS 60 

AGAATCTCNN AAYTCGGCAC ANATGTCTTT TSTMTAKTGT GGCGGGGNGC CACGCCKTAT 120 

GTGYGCCTGG GYTRACCCAA CCCCGCGGCS CGGGCCRACC AGGCGGGGRA TSCAGGCCGC 180 

GGCGGCCGCG GCGGYTATAT RAAGCGCCGY TTTTKTRATA ACGGTSCCGC CGCCGGGTRA 240 

TTACGGGCAA AAYCGGKKTT TTGGGTRTAT AACGCTAATT GCAACCAWTT TTTYCGGGTC 300 

AAAAACYCGG CGWGCANATC NCGGGYCNCT RAGGCGCATT YMCGCCAAAA WTNTGGGCGC 360 

AAAACCCCKT TSYTATTTTN TGGGCTATSC GGYTGCTTCG GCAAACGCTY CCCGGGTTAA 420 

TCCCKTCCGC GGCGCCGCCN AAAAACCACC AATYCCGYTG GGGGTGKYCC CMCAGGCSGT 480 

TGCTYCGNGY CACCTGGCCA AAYYCCCAWT AKATTGGGTG SCYCKTSCGG TTSYTGGGCY 540 

CAATTACCCC CNCGGGNAAA GRRAAAANAA ATCNTCCNTT TGCTCGGYCA YCTTTMTTGG 600 

SAAAAGGGGC ATGGCS CGGT TYYTTTACCT CAAYCCCCNA NCANTWACCT YTCCSCCCGG .660 

GGGGNCANAA CGSTTNGCTC CGSGGNAKCC TKGTMCCCGN ATCNAAAGGC CNGAATTTGG 72 0 

TYYSSTYCNA ATTWTWKKKY CCCCWCNTTG YAAAAAKCCA AAASAKCCCK YCNCAMMYKT 780 

NGGGGTYSSG GCCKNYCTTK SNMTTAAACC CYCCCCAAAA YYNS GGGKKT TCCGCYNSAT 840 

KCCACCNCCK GNGGGGGGNA SAAAAAAAAY TTTYCCSAAA ATCCCACCYY TCYKTKSTRY 900 

AMACCCCCTT TYYMKKAYTC CKYS CNATTC SGMTTCWAAA TYCCGYGGCT TNTTCCCCCK 960 

CSGGNGCCCC AAWTTTGKTT YNCNANTTYC CCCNAAMNCM AWTMGGGGKS KCCATTCTGG 102 0 

SCYTMAANTA AAANAANGGG NKTTTYYCTY MANAAACACN GTGKCNCNCN CNAAMAAASN 1080 

AKMAAAKAGN KKKMTKNNSA AANCCNCCCC CTSTYTNYTT NKTNMNCKCC CYGGKKNKGM 1140 

SWSWYNTTCT NCCCRCCCCC YNYNKTGANA AAMMNCYCCS GGSTMCRNAN ASNMNTTTCK 1200 

STSTNGMGCC KMB AS NANAN MCAMWKWYCC 1230 
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(2) INFORMATION FOR SEQ ID NO: 330: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1022 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 330: 



ru 
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NGNGGGKNNA TMAYCWTCTC ACSSGGTCTA TGCGGCGCAW CTMGTMAASA GATCTCNAAY 60 

TCGGCAMNAN GCATMTCMMC CATATATAAC CATTGCGTCS GYWTGCAWCT CRAAWCTGTC 120 

'CTTCS'KGCCG TTKTACRAAG GTGGMWTGYT CWTYCGT-RAA SGCGTGRATC TCKT-KTATYC 180 

CTKGGGCTYC ACTTTAACSG RATKSCTGCC TTKTAYCATT RATGCAAWTA WTGGYCRAWT 240 

KTTGCAGGCC RACGGCWYCT TTTYCCGCRA GRACAATNGA TTGGAWYCGC TYCGCRAGGC 300 

CCGGCACCAR ACCGGGCNCC AAAGGYCCGC GCAAWTSCCT GGKTCAAAAA TGGTGCAAAC 360 

AAAMCNATCC CCGGYTTRAC CGCAGYTAMC ACAAKAAAAT TCCCWTGGCC GCACCAWNNT 420 

TTYCRATCWY CWYCCCCACC TTRAACTTGK YTGCSGTATT GCCTKCCTGC CTCRACAGCM 480 

YCNCCCKTCA AACCTGCGGT GACTCCAACT GGTCTGGYCG AASGGGGGYT CAMCGGACAA 540 

AACCCCRANN TCGCCAAATT TTCNCCCCCC CYCGGGAAAN GKTGATMTTC TCSNAACCSA 600 

CMGGGNNYTW NAACCCTGAA CSSSGSNKGA MYNSCCSGGA ANTTTTCCCT TYNGGGCGRN 660 

AAANCCTTTT AAGGTACCCC KGGNGGGGKG CCCYYTTGGG AAAACAACCC CKATTGGKTT 72 0 

TGGAAATNTT TKCNCCCCCA TTCNS GGGGG GGGCCCCAMC CCMMCTTTTN TCMSCNMTYY 780 

YCYYGGGAAT TNYTCGCCSG GAAYYCGGSM CCKGYCCTAA NCCCCMNWGG GKYSTGSNAR 84 0 

GGRATMAWWT TYSTTTYYMC CCGGCNNCCC CCCKAKMCNT KGNTGAACMA AAAKCSGGGG 900 

GSCNMYMWYY YCNNNGNRTT TNRGGSSNMT TYMAAAMMAN GGGGKYWTYY CKCCNGSCNN 960 

GKTYSGGGST TTTCCNTTTS GGGSSATYKG MACCCCKTMT AYCCGGGGGT NTKTKYCCCC 102 0- 

SC 1022 

(2) INFORMATION FOR SEQ ID NO: 331: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1083 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 



(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 331: 

NNCGNNKNTA TAMAYCWYCT NCACCS GGGA TCWATTGCGG CCGCAATCTT STMAASAGAT 60 

CTCKAAYTCG GCAMGANCCG CAWCTATTTG KGTGRAS CGC AC C AGCGRGA CCTCGCSGKT 120 

CKTTYCTTGC AGRGAGGCCK TGGGTGGCRC CGGTGGCAAT GCCAACCGCC CCCCAAAACN 180 

C CGC AAATMY CRAAAAACAA CCCSGGGGTA GKTCCSGGCC GCCAAATMAA TAACCGTKTT 24 0 

AACKCAGGCN ACGGCCAACC GGYCCCGCCC AAC CAAGCNA CCTCCCCSCC NATAGGYCCG 300 

GTGGGGGCTG CCKTATYKCC AASTCGTCAY CTCNACGGGM CGGYCCMCWT TCCGCCTCAT 360 

CCGTCTCTCC TTMMATTTTC CRTCCACYKG GCGGGGAACY TTTTTNYCNC CCTTGSCMAN 420 

CACCNAAGGY CNAAAATTNC CCMTGCCKYG SNNCAAAYGR GATTGGGGTY CGKKTTTTNT 480 

TCNMCCMAAC CCCCNTTTNA CGCCCCMATC CCYTWATACC CCCWWMCMNS ANGKTTGNS A 540 

AAKTNNCCCC AAATRC C AAA MTTCTTCGCC NTTTMTWMCY YYCCTTTCCC CMCCCWNAAA 600 

GGSCCRCCYY TCGGGAANTY TCCCCNCAAA AWTCAMWCCM TTTCCCNCCA AGAAWTTCSG 66 0 



# 
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SACTCCTTTN TTCNGGGNAM ATANATYYTT YCKTNGGGSK TTCCGMTCNC AMMAATNTCC 720 

RGGGKAAMCC AGKNTNNTCC YYYYCCCCAA NNTYCCYKGG RMCYNNYYCY TTAAANRASR 780 

SAACCCKSGG GKCYNCNCSS TARCCCCCAM KAAAATTTCC CCCSSKTTTC TYYNNKKMRW 84 0 

GCCCCCSAAM ACTMTWAYTT TCCCKCGNNN TTTSYCCKCS KCAMWMWMTG KKNCTTTTTT 900 

YCS CMATAMA CTTNGGKCCT NTCNYGSGCG CMAAANAAGG CGCGSTTCTN TTCWMAMACA 960 

YNTSGNMMMA SAAKAKWATA AWNNTRKKYK TKNNCCCNCC CKCKCTTSNN TNKCCMCSKS 1020 

GGGKNWNKKR GWCTCCWCNC CKCCCNCKNK CCKWATMCCC CCCCSKCCGM NCMMNTTTKT 1080 

CCC 1083 



(2) INFORMATION FOR SEQ ID NO: 332: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1069 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

h* (ii) MOLECULE TYPE: Genomic DNA 

o 

P (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 332: 

CO 

45 GGGGNNKYAT MCAYCWTCTS YACSGGGMNC TATTGCGGCC GCAWYTNGTM GASAGATCTC 60 

03 GAAYTCGGCA MGAAAAAAGW GATGTGCTGG ACCTTMCCGC GCGGGACGCR ACCRACAAAG 12 0 

Jp RAASCGCGCC ANAATATTGG CCACAKTTGG TCACATATTT ACCCAATTMT AYCAGGGAYT 18 0 

hj MCCATTCCKG GGACCRACCG CACAATCCCR ATSKTGGTTT GCRAACCCTR ACCGTCCCCA 240 

J MYTYCGCCRA S TTGAAC C AG GGCRAAAAAA CGGCCRAAWY CTCGCCCTGA NTCCCGCTCS 3 00 

Q GCGCNAATAA CT AGGC C CAT TKAACGGAAC CGGNGGCCSC NANTTGGCCA ACAGGTC CTR 360 

fy ACAAAGGGGC CCCASYYCGG CCGGWTCCCW TTYCACNCCC TNKTCTCKTG CCGAATYCGG 420 

Jfl WTCCRATNYC CCWTGGGCCT TKTCKYCKYC KYCGGTNCCA AWTCTNGGTA TNCTATRGKG 480 

jjs TCCCCTAAAT SCANATCTGG GCKYCCATTT NCTGGSNTTC NATTTAMMAN SRRCGGTTCT 540 

Jfj TTCWTTCCRA AACCGSNTGG GCCCNNMCCA AAAAATGATN ATAATAATGK YGS CTTTCAA 600 

Jrj ACCCCGCCCC CCCATTCRWT CSGTTCCANC CCCCNGNGGT TAAGKTGGGA ATTTYTNAMC 660 

sy YCNARGCCCT NATTTSGGNA AAAACCYCYC GGGYCTCAAA CMNYTTTTTT GSKSSNTCGG 720 

GCTCRTTCSC CAAAACCCAA ATTNTYNYGG GGYCCKTNAA ACMCGGYCRC RCCGGAAATT 780 

TTTYTGGTTC AACCCCAACC TTTTCAASCC NTTTTYTYYT TRCCSSCSMN TNGS SGGGNT 840 

KSSCCNTTCY RARKKCCNMN GGGGGWYCYN CCCCRMNTTT CTTTTTTTTT CCGTNNMAAM 900 

NGKTTCTTCA AASMCCCCCC SCCCCCNSAA ACCCCCTNAR GTTTTYCMMA AANNWYNNGN 960 

KNCCCCCCCC MMNAAAAAAY YCSCCCGNRN ACSMSNGGGA MCCCCCGGSN NTTRKTTTTT 102 0 

TNCMSGYCCC CSRMASYYTT TKAMAMANRR GAMNSMTTTY TNNRGNWNK 106 9 

(2) INFORMATION FOR SEQ ID NO: 333: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1210 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: Genomic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 333: 



NGNGGGGKWK MATACATCWT TCTTCACGSG GGATCWATTG CGGGCCGCAW TCTNGTMCAA 
SAGATCTCGA TYTCGGGCAM NACCCACCWC TCCRAAAAAA ACCCRAAWCT CGGGSKCTYC 
GARAAGTGTT GCCCGCKTTR AATTTAACAA ATTCAGTGTC ANAGTGTCAC GGCKTTACWT 



60 
120 
180 
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YCCCGGCAAA 
CCGGKTNAAG 
MCNCAGGGGT 
AAAAGGGCTG 
CTCTCKTGCC 
ATCTGGGTAT 
TCCAAATTTA 
AAATGAATAA 
NMNCCCCCAG 
KAAACCCTYC 
SCCAAAATCC 
TTNTGKTTSA 
GGGNATKGYG 
NYWRTTTTTN 
AAAAWCCCCN 
AATMYCCNCC 
GAGCKCCSCG 
CTKSTCCCKS 



GGGGCCACAA 
GGACCTGCCT 
GTCAAAAAAC 
ATGCCCCCAA 
RAATCCGRWT 
CCTATAS TGT 
CCANCAACGG 
TAATAANNGG 
NGGKTAGGTK 
YCNGGGTCWW 
MAAATANTTT 
ACCCCAAAAC 
SCYTNTCTTA 
KCCCCKTNMR 
NNNARAKTNT 
.RASANMCASM 
MNNANMKNCK 



CCTGCAGRGA 
GGGTKTGCSC 
CGCAAACAGG 
GGGGGCCCGC 
CCRATNYCNC 
CCCCTAAWTT 
TTTCTTNCAT 
KCNNTTYCNA 
GGGAAANYYC 
TYMAAAAAMA 
YYTGGTYCNA 
YTTTTCMNAA 
TKTKYTYMTW 
NNRAANNGGN 
TTMKANNRMN 
NMGGRGNRSC 
CTTTTTKCNC 



SCACYCRATG 
TMCAAANATC 
CACSCCANCC 
NCCCAACCTT 
CWTGGCCTTK 
CCAAATCTGG 
NCCAAAAACC 
ACCNCCCCCC 
TCMACCYYCA 
NTTATTTGGN 
TWAAAAAMCG 
NCSSKTTTTY 
CMGGGGGGNN 
YTCSYNANAA 
SCKCNKNGKY 
CCCCCCCSTT 
NNGNNGNGNN 



GKTGYTGKTS 
WYCCGCGGGT 
NTTTACGGGS 
CCGTTGGTCA 
TCKYCTYCTY 
GCTGTCCAT.T 
GNTKGGCKCC 
CCCNATTCCA 
ANCCCTWARS 
NGNTTTCGGG 
YGNCCMNCCC 
CYTTCCCCCC 
MKMTCMMCCC 
AAGCNCCCCC 
YCCCCCCCWC 
NNNNTMTTNT 
GGNGMNCKCC 



CNCGGGCGGG 
YCGCTGGRAT 
CTTAAAANGA 
ACAACCCGGT 
CGGTACCCAA 
TSCTTGGCNT 
NRACCCRAAA 
TYSNGTTCCA 
TTTTNGRAAT 
MWNCKRKNST 
GGAAAAWTTT 
AMNWTGGGYS 
CCMTTTYYCY 
SCCKNCCCNA 
YNMNNAAAAA 
TTTTTTCSRA 
CCNAGAAMWK 



240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1210 



(2) INFORMATION FOR SEQ ID NO: 334: 



03 
W 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1105 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 



m 

nl 

in 

■ar s 

o 

m 



(ii) MOLECULE TYPE: Genomic DNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO:334: 



NGSSSNGNNA 

CGAAYTCGGC 

ACCGAGACGC 

AACCGGTGYT 

AGCCACCTGY 

AMCASCSCGC 

GCCGCCAATT 

ATMGGGRACT 

SCRATCCMAA 

TTCGAAMCCA 

ATTTCSGRAA 

AAAAASGGTC 

CCKNGTTTAA 

NTTTCCGGKC 

CGGSSCCCCG 

CCSGGSCCCT 

TCCGCGSAAA 

GCNKNTWAAN 

KAAANAAYNC 



TMCATCWYCT 

AAKANACACC 

GCCGGCCGCG 

RAGGGCCGCA 

TGTGGYTGCC 

CGGTRATCCC 

CAAAYCKYCT 

TTAAGGCCCA 

AAAAKCKNAT 

ATTMCWGGKT 

SAACCCCTNY 

GGNCAAANGG 

AWKSCCTCYY 

CCGGGKGKGK 

CCCGGGGGGA 

TTKRWAAAMN 

AAATAKMTTT 

ARSNCCKKNN 

NGSMSTSSMN 



GYACS GGGMT 
ACCGCCGTGT 
GGGYTCCAAC 
CCCAACWTAA 
CTCWYCGGTG 
AKCNWTCCCC 
GRWTCCTTCM 
GGCAAAACCT 
TTCCCCCAGC 
NCNWGGGAAA 
CCCGGGTTTT 
GCMAAACCCS 
CTSCCCAAAY 
WGKYCTMNMA 
NNTTTTTAMA 
KCTSCCCCNG 
SYCCCCCCNC 
CCNCYKCCGS 
CNKCC 



CWATTGCGGC 
MTATACACCG 
GCKTTACYTR 
ACGCTTTAKC 
GTAGCGCCGG 
CGGCCMRACC 
AAACACCACR 
NTRAKCNCCT 
AKCAACCCAA 
AACASCNNCC 
YCCTGCTCMG 
SACCCMACTT 
TCGGKCMAAA 
CSTTTNTTTT 
GKKTYCCCCT 
GNNGGGGKCM 
CTCCKNCKNR 
NSNGKCNWCD 



CGCAACTNGT 

CAAATGTTCT 

ACCCGCCAGY 

CAAGRAWYTG 

TTANCGCCGG 

CACCGGGCAC 

AAGGCCACCM 

CCCGGGCRAA 

MMCGSTTTGC 

NWTAKCCMGG 

GCCCAANACC 

WTTCCRCTTN 

NNGRKTTGGK 

SCCCCYKAAA 

CCCCAMAAAA 

GGKTTATTMT 

GKAMSMSCGC 

NCCSTSSNCT 



MAASAGATCT 
GTKTGCCAAA 
TCAGTGTTRA 
GKTGGCCCGC 
TTGCGCGYTC 
TTTGRACGGT 
CCMSCACCNA 
GGTCCSGCAA 
TGCTTCCGGA 
CCCMCGGGCA 
CCCGGGAATC 
GGGGGGSCWN 
TTNGGCNACC 
NYSCCCCCCC 
ANACCCCNYC 
NNNCCSCCCC 
TCCCYCTCNC 
NKGCNCKNCN 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
66 0, 
720 
780 
840 
900 
960 
1020 
1080 
1105 



(2) INFORMATION FOR SEQ ID NO: 33 5: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 936 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:335: 

NGSNSNKNNN TAMAYCWYYC TSCACSNGGA ACWANTGCGG CCRMAWCTNS TMKASAGATC 60 

TMGAAYTCGG CAAGAGCGGC AAGAGTGTGT GCATCTGGTC ANAGTS TMMA CRCGGTGCCG 120 

CSGGTGKGTR GAS CACMCAT NTGCGRACAC CAAACCCKTC GCGGGYCACC GGCKTCGCCT 180 

GCAAAWYCCT CCAGGCCACC TCRAACAAYW YCTYCTGCAA CGCARGCCGT TYCGCGGCCG 240 

RATCCTGGKT CASYYCGCCK TGCGGTGCCC AAGKTACTGG CSCAYCAAAA CCGCTCCGGG 300 

RAACRAACKT AAWTYTGCCG AATTTCNTTC CCCTGCGCCT TGATAAATTT NTNAAGCCAC 360 

CGCAAMCCTY CGGGCKTCTC CTCKTGCCRA ATYCGRWTCC RATAYCGCCA TGGCCTNKTC 420 

KYCTYCKYCS S-TAGCGAAAT CTTSGGTATC CTATANTKYC CCWAAANRCA AWTCTGGGCK 4-80 

KTCCATKTSC TGGSKTCCRA ATTTAMMACA NCGGTTTCTT TCWTACCAAA AACCSNTGGG 54 0 

CCCCRACCRA AAAAKGATAA TAATAAKGTG CWWWCAAAAC CCCGCCCCCC RRTTCAAYCG 600 

5 GTCCARCACC CCANGNGGTN AGGTNGGAAT TYTMAACCCC CAGCCCATAA SNTTNSGNAA 660 

2 AAACCCCCCN GGGYMYCAAA AMMCTTTTTG GGGMTTCSGS CCATKGYKCC AAAACCAAAA 720 

MJ TMTTTCYGGT CRWAAAAACC GGCCCNCCCG NAAATTTTTT GKCAACCCCA AACCTTTMAM 780 

-P CCNNNTTCYY YCCCNSACAA TNGGSGGNKN NGSSCNTTYT TWTTTYYNNA GGGGGGRRWC 84 0 

to SNCCCCNAAN YYCCNAANKG NKCCCGSNMA AAAGAGANTT YCMKAAAAAC CCCCNCNCCC 900 

jp NAAAYACCCC MAAAKWTTCM AAASMS CNNG YCCCCC 936 

s (2) INFORMATION FOR SEQ ID NO: 336: 

fU (i) SEQUENCE CHARACTERISTICS: 

fy (A) LENGTH: 1042 base pairs 

•iff (B) TYPE: nucleic acid 

Q (C) STRANDEDNESS: single 

pj (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:336: 

NNNGNKNNNY ATMMAYTCWY YCTSCACCSG GGNNWCWATT GCGGCCRMAW KCTTGTMAAS 60 

AGATCTMNAA YTCGGCACAG ASSSGCACAG ASCCGCGGCG CTATYCMYCC GYTGCTCATG 12 0 

CTCAACACGC TCKTCGGCGW GRATAATGGC NCGCCGCCGG CGCCAACACG YTCAAYTGCT 180 

TCGCCAACGC CATATNTCAA CAAGGTRATA AAASCAAAAC CGCSCGCCGY GCCCTTGGGC 24 0 

S CGGRAAS CG GTGCCAACCC RAAACNCKTT GGGCACYCGG KTS RACTTTA AASGGTAATC 300 . 

TCKTCCTCCT GGGCTATGGT GCGCCACAAA CCTSYTGGCG WGGGTCTGGC CCTGGGYCAC 360 

CGYCRCNTTT TATNTNTCCK YCTACACNCT TKGGTYCAAC CAACCCACTT CACMAAATTG 420 

TTTTGGGKTG GGGSSGCCGG YTGTNNCCGK TAATAATCSG NTGKTCSGCC MYCACCGGWA 480 

CCATANCCTG GCCGGCSCTG GCAAATTTCC SAAATCATYT CCTTCTGRAC CCCCACAMRC 540 

CTNSAAATCC GRATCAATNC CCCNKGGCTT NTCYCTCTCN GTRCCCAATY TGGTTTCTAT 600 

RKTNCCCYAA TSCAATTGGS TTYCCRTTSC YGSTTCCAAN TTNACAAMAS GGTTTYTCMT 660 

ACCAAAACCC NTGGS CCNNA CMNAAAAKNA RAAAANAKGG KCTTTYAAAC CCCCCCCTAT 720 

TCAWYCGGTN CMRNWCCCCG NGKAAGGKGN GAAAYTTHRA CCCAANCCMT ARSTTSGNAK 780 

AAACCCYYCG GGGTSMCAAA MKNTWTTSSC CTTCGGMCTT YCCAAATMSA AAATYYTCKK 840 

KRMNAAAAMC YGNCCCCSAA ANATTTTTGT NAAMCCCKMA YYTRTTWMCC WTTTTCCYCC 900 

CCMCNNSNSG GNTNCCCTTY TYATTTCYMM MCRNNS GACN CCCCMNTYTT TWTTCKCWCN 960 

MMARGSNNYT RGRMMNMNCC CCNCCCCNAK MTCCNCAAAK NTTTNAACNN NNKYCKCCCC 102 0 

CCCMWMNKNC CCCCMNCMTT TM 1042 
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(2) INFORMATION FOR SEQ ID NO: 33 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1073 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 337: 

NNSGSGMKKK ATAMATCWCT CTSYACCSNG GMTCWATTGC GGCCGMAWTC TNGTMAASAG 
ATCTCGAAYT CGGCAAANAK ACGCMAYGTC AAGTGTRAYY CGGTCACATA TCMTCGCGNG 
TCAAGMCGAA ASGGSNGTGA CGGYCTCCCT GGGSCGCCAC CCCCATCGGT RATGCAKCYT 
CGCGCGCCAC CGYCAAAAGG KTCWTTRAGG CGCTAAAGGT CAMCAATTCC TRAGGTYMCN 
CACCGTTNTT TGGCCCGCCC RAWTYCTRAC CCGCAATWTC GGTAATCGGR AATTTGGGCW 
YCGGCTTGGG CAATAAGKTN TTGGGCAACG GCGGRWTCYC NCTGGCCGRA ATTC CCNCAT 
TCCKTTAACG GKTGRACCGT TTYCCCGGYT GCCGTAAYTG YTYCNTGGGC GCCYTCGGCC 
CRNAGC AS YY CRCTAACGGY CMCCAGGCAA TACCKTTGGC TTTRAACCAC CGGRATNAAY 
TGKTACCCAC YTCAASSGTS CTGRANTTRK TNTCNTGRAA AANMCCACCN AACCCGGNTT 
RATCTGCTTC MTCANCWTTT SCCGGGTTCT GCCGTTTTGR AAYCTTNATC CMTYCAAAAG 
GTTTAMTTTC CCAANRAATT CGGYTTGCCA CCTTGGCCGS GGCTGGTTTM CGMWCCTTRR 
AMATCCNCCS GCGGGSAAAN AMTTSGGNTT SGSCCGGTCC CCCGNAATAT YCNTGGNCCT 
GNAAATTGSS GGGATCCCCN GSGNAYCCGG CCWTKGGGGK TNCCCAGTTG GWACAATTYC 
WKCCGTTCCA AACCCGGGNC CGGGGGGTGG GSCCCNTTTT CCTMYNNAAA AAGKGTTTGN 
NYYTTTTCCG CNRAANTTCA CCS KCNKTNT GGNCCNAACY YYYCAANTTC CANACCTTTA 
AASAAANCYK YGKTYYCCCC TTTTMCCSGS SANCCCCCCM NMSSKNCGGG AAAAAAAGNK 
TYNGCCTTAN CNSNKTKTTT TNKTYCCCCC NMWNNSNMCY NCBKKCNKRY NGNSNMNCCT 
MKYSKCNNNN SNNNNNKCGN GSNCSGMKYM CMNN CNGMYK NGNKSNNCCC MSC 

(2) INFORMATION FOR SEQ ID NO: 33 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1061 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 338: 

GNSNGNKNTN TMCAYCWYCT SCACSGGGTC TATTGCGGCC GCAATYTNGT CKASAGATCT 
CGATYTCGGC AMNANAARTG TCGTCGTCAA TTTCAGKKTG GTCKTCAAAY GGGCCAGGCC 
GNGACCRACA CCCTGNGTCA CCCAAAANAC CAACAGCWTC AAATWTCAAG GCCRAGGCSC 
TRTCAATYCC CRAS CAKTT A ACCGTKTCCW TCRAAGGTGC CRAACCAGGC AC C C AG YTCA 
CCGCCSGGCA AWTCGCGCTG CCGGCCGGTN TCAGCCTGAT TYCTGACCCT RWTCTGTSGG 
TGGYCAMCNT GGTGAAGGCC CWWCCGCCNA AGAACTGGAG GGCRAATTCC CAGGANCCNA 
GRAACCCNAG GAACCCGCGG T AKAANC CGG CRAAACCRAG GCCGYTGGCN ATTCCNATTA 
NAMSGGTTTG CRACNTGGCC RAAC CGTTTY CTTGGTCGGC CTCGGCAACC CTGGACCANT 
TACCCCKTNC CCGGNMCMAC CYCGGGTNCT TGKYCCCAAT NTGCYCCCGC GNRANTNGGC 
CNAATTCCAG GG CNCCANCT TTCCGGCCCN AATTCCCYTG GTTAATCACC GGGCNCNCCT 
GGTTTTGGGC AACCCCNCYS CTTMTTTAAA CATTCCGSCC CAAATGGGNC STTGGSAAAT 
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TCTNTYCGGT GGGGCSGGCR ANMYTTCTCT YCCCNAASAN CTTAMYC CAN TTCGSSNTCC 720 

CGGKCAAAWS NGGGGGGGNA AAGGGCCCCC CGGNTSCKCC GGGGKKGCCC CYGGKTTCAA 780 

AANTTTCSGG GKTSTMSCGG NVTCSCCCCC CSGCCAAGRA CCGNGGTTTT TTTTTGAACC 840 

KCMANTCSSA AMCCGCCSSC CCCMAAAGGS GCCTNAAWGR RAYTTNKSCC CNNAAACSGG 900 

CCCCCAKYTY SGGKTTCNNC CNCCS GKKGT CCMTSTTTMM MRCCCTTTGN GNKTTTTTAN 960 

MGSCCTTNNC CACCCCCYCK GGGKCSMNNA GAAKTMYWKC CNGGGGNNAN RSCCCCCCNN 1020 

GSGKGGGGKG MGAGYS CCKT CTKGCGNCNN YKNTTTCCCC C 1061 



(2) INFORMATION FOR SEQ ID NO: 339: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 986 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



a .. 



(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 339: 



fjj GNNGNNNKWN ATMCAYCWYY CTSCACCSGG GMTCWATTGC GGCCGCAWKY TNGTMAAS AG 60 

j= ATCTMGAAYT CGGCACANAG CGGCACAGAG TGTGTGCATC TGTGTCANAG CTGTCAACGC 120 

J GGTGCCGCSG GTGGTRAS C A CMCATTGCGR AACACCAAAC CCGTCCGCGG GYCACCGGCK 180 

" F TCGCCTGCAA AAYCCTCCAG GCCACCYCRA AACAAYWYCT CCTGCAACSC ARS CCGTTYC 240 

l = .3 GCGGCCGRAT CCTGGKYCAS YTCGCCKTGC GGTGCGCCAA GGTACTGGCS CWYCRANACC 300 

* GCTYCGGGRA AC CNAACGTA AATCTTGCCN AATTTGCNTT CCCCCTSCCC TTRATNAATT 36 0 

IL TGTTAAACCA CGCAAACCTY CGGGCKTCTC CTCKTGCCRA WTCCGRWTCC RATNYCGCCA 42 0 

lb* TGGCCTNKTC KYCTYCKYCS GTMCCCAAAT CTTGGTATCC TAT ATTGTC C CTAAATGCAA 480 

L^j ATCTKGGCTG TCCATNTGCT GGCGTTCAAA TTWAMANCAG NGGTTTCTTY CTTCCNAAAC 54 0 

CCSTTGGCCC CAAACCNAAA AATGATNATA ATAATGGTGC TNTCAAACCC CGCNCCCATY 600 

y) CNATCSGKCC AMMCCCCRGN GGKTANKKGG GNAATTCTMM AACCCCAAGC CATAASNTTG 660 

y SGANAAACCY NCNCMGGYCA CCAAAACANY NTTNTTGGNY SSNTTCGGMN YCATGGCTNN 720 

Rj CMAAAACCCA AAT AC TNYYG GGYCCAATAA AAMMMSGGYC SAMCCGGAAA WTTTTYTTGN 780 

KYNAAACCNA AAKCCTTTTT CNAACCCDAN WNTYCCTNCC RCRCMANTGG CNSGGARTKT 840 

SSSCTTNCCA ATGKYCCMAA AGNGGGRANA CCARCCCCAA TTCCTNNNTN KNKNCCCNST 900 

TRNAAAAGGG GKNTYNCMAA AASCNCCNCC NCNCTCCCAA AAKAMCCCCN AAAGAKNTCN 960 

NAANASKYSN NNNSCCCCCC CCMMMN 986 



(2) INFORMATION FOR SEQ ID NO: 340: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1074 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 340: 



NGNGGGNKRN ATMMAYCWCT SATYYACCSN GGMNMWATTG CGGCCRMAWT CTNGTMKAS A 60 

GATCTMGAAA YTCGGCAAAG AGYATKCTCG GGGGCCAGAT TTNTGGCCCG CAACCGCCGC 120 

ACTTTGCAYW TCAACAKTCC SGGTGCCCCA AAAAAWTCWT ACCCCCATMC TYCKTGCASM 180 

ASYTGCGCCC RATTRAACAC CCGGCCGGCW TGCTGCGCCA GGTATTYCAS CAGYTCAAAY 24 0 

YCTTTKTAGK TAAAATCCAG CSGGCGGCCA CNCAGCCGGG CGGTKTAGGT GCCTYCRTCA 300 




Q 
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ATMACCAGCY CGCCCAGGGY CACCTTGCCC AAAAYCTCCT GGGTCAGCCA AATTYCCGCS 360 

CCGGCCAACM ACCANCCGCA TYCTGGCNTC AATCYCACCG GGCCCGGTGY TAAAMMANMA 42 0 

GRATCTCKTC MANCCCCCAN TCAGCSYTNA CNGCMACAGC CCGCCTTCTT CAMACCGCCA 4 80 

RTACCGGGWT CAACCGGCCS GTCAAACTCA ACAGGCGGNC AGGCCTCCCC CGGANSAAAG 54 0 

GTCTTACSCC NNYAANAAAA MAAGNTCTGT TTTCCCCCTC CASAASNAAA AANCCCCSGC 6 00 

CGGGC CTTCN NMMGGGTTTG GGGMANANAA AARCNCCGGN GGAACGNATC CGAAAMCTCC 660 

CAAGTCNCMT TWAWAACYCN NNAACCCCCC ANTTTTGGGA AAGGNTCCCC NTTMYCCCCC 720 

TTTTASGKTS GGGMMYYCTY TAAAAAAATT CCCCAAAAAG CCCCGGGAAG GGTCMAMCTG 780 

GGNAAATTTC CAAMCCNWGK TTNTTYNGGT TMCGGGGGRA AATTY CNCTC CCYYNNNGGG 840 

CSSGSNNNAT TAYGGMSNMT TTTNNAAWTM NSGKKTSAMM YNNKCCMNNN SNNMSMANNK 900 

TNAMCKCCCN CCTCNGNGKY CSCYNCCCSG GNAGNGGRAS MKCCNANMAA AYASGNTTNK 960 

CGGAAMMCNN AATKGNNNSC CCGGASMCMN NNNMAAATMT CNCNKCNSNN AANRGMRACN 102 0 

CCCNSNSGMN RRGAARMTNY YCCCCCGSKM GKGNKAAAAW GKYCCCCCCM AAAG 1074 

(2) INFORMATION FOR SEQ ID NO: 341: 

(i) SEQUENCE CHARACTERISTICS: 
y (A) LENGTH: 1195 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
m (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:341: 

L. NGNGNCNKNT MTACATCWTT CTGCACCSGG GNTCWANTGC GGCCGCAWKY TTGTCGASAG 60 

~rj ATCTCGAAYT CGGCAMGAGG ACWCTCGCRA CGCCCCCACA NACTCTGGCG TGTGTACCCC 120 

^ ATTGNGCGCK TCACGCGCCC AYTGANC C AK TNCACTGGGG TGCCGTYCGC CKTGCGCGGC 180 

GGCCTCACGG CKCTSCWTCT RAAGGCWTGG CGCACCGCAT TCGGTTTTCT RAACGCTGGG 240 

AAAWTGGCCA GCCGTCTGGC TCATGGGNTC TACGCAACGC CNGCCCCCAA CRCTTTCTTA 3 00 

D AATCCGGYCC NTCCTGANCS CTTTGAAYCC CGGGGSAAGA ACTGGTTGCS CNCGAYCTGC 360 

ry TCGAACTTRK TCNAAATCCC GCANAKTGTT TCNTAMGYCC CNCCGGAAGG NGAACCTACT 420 

TTCNGGWANG TCGGCNKCCG GCGCTTATCA STCCTGATCA ACGGGGAACT GGYKNNSTTG 48 0 

KGGGAAAAAG RRCCTCAATG MTYGGTC CKC GCTGCGKANC CGCSCCCTGK GYCGCNAATG 54 0 

GAAGGCSMAG GGTTAANGCC MTTYCNYCCR RSCCGTSTGA SGKWTTYCGG MGGANKAMNN 600 

NNKMAMWTTK TCRGNGGCCW ATSTSCCGGG CKS TTAKAGA ANACTYC CKW WCCGTNTYSC 66 0 

SAAAGNTKCS GCGMGTTTTS SCCKMGANGN YCTGATTTSA GGGGGKYKCC CCCGGGGTYC 72 0 

CGAAWKWRKY C C YAGGGGGM GNYCSAGCSC CGMNNATNAG AGNAAGGKTT RYGSTSKNCC 780 

TYTNKGGACC WSCNNCWSAK ANAACNNKKT TGCSCCNTMS AGNKTNKGRT YCCNKTSTTC 84 0 

TAAGAGGAGC TATKMKCGCC CKTGGANGMM GAGWGMGCGC KYCCCSNKRT TCNTNGWAAA 900 

TATKSAGMGG TKCCGMAGMK CCS CGTTTKT TKTGANAAMN MSMRKNKKTG CGMGYTCTSC 960 

GGGNTTTGTA GAGTAKTCGS CSCSSMWGAC WCSGMCMGNG AGKNKTNNTS YANTGARCGY 102 0 

MNNSKTMKMT MSCSCGCGNA GGAGNGCCCC CSANGMSTGY NKGGNMS SNG ARAKGATGGS 1080 

GGCCNCGMNN MGMGGANMGA SANNGMGGMR GGGGGKTGKC TCKCSCCGNS CSANGRAGAA 1140 

GKTCNGS CGC CGMGGKYGKT KTKTKNKTGG YSTCMSSMMM NAGAAAAGAG AGGGC 1195 

(2) INFORMATION FOR SEQ ID NO: 342: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3572 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 342: 

CCATCTGATC GTTGGCAACC AGCATCGCAG TGGGAACGAT GCCCTCATTC AGCATTTGCA 60 

TGGTTTGTTG AAAAC CGGAC ATGGCACTCC AGTCGCCTTC CCGTTCCGQT ATCGGCTGAA 120 

TTTGATTGCG AGTGAGATAT TTATGCCAGC CAGCCAGACG CAGACGCGCC GAGACAGAAC 18 0 

TTAATGGGCC CGCTAACAGC GCGATTTGCT GGTGACCCAA TGCGACCAGA TGCTCCACGC 240 

CCAGTCGCGT ACCGTCTTCA TGGGAGAAAA TAATACTGTT GATGGGTGTC TGGTCAGAGA 300 

CATCAAGAAA TAACGCCGGA ACATTAGTGC AGGCAGCTTC CACAGCAATG GCATCCTGGT 360. 

CATCCAGCGG ATAGTTAATG ATCAGCCCAC TGACGCGTTG CGCGAGAAGA TTGTGCACCG 42 0 

CCGCTTTACA GGCTTCGACG CCGCTTCGTT CTAC CATCGA CACCACCACG CTGGCACCCA 480 

GTTGATCGGC GCGAGATTTA ATCGCCGCGA CAATTTGCGA CGGCGCGTGC AGGGCCAGAC 540 

TGGAGGTGGC AACGCCAATC AGCAACGACT GTTTGCCCGC CAGTTGTTGT GCCACGCGGT 600 

TGGGAATGTA ATTCAGCTCC GCCATCGCCG CTTCCACTTT TTCCCGCGTT TTCGCAGAAA 660 

CGTGGCTGGC CTSG-TTCACC AGGGGGG AAA CGGTCTGATA AGAGACACCG GCATACTCTG 720 

CGACATCGTA TAACGTTACT GGTTTCACAT TCACCACCCT GAATTGACTC TCTTCCGGGC 780 

1* GCTATCATGC CAT AC CGCGA AAGGTTTTGC GCCATTCGAT GGTGTCCGGG ATCTCGACGC 840 

Q TCTCCCTTAT GCGACTCCTG CATTAGGAAG CAGCCCAGTA GTAGGTTGAG GCCGTTGAGC 900 

p ACCGCCGCCG CAAGGAATGG TGCATGCAAG GAGATGGCGC CCAACAGTCC CCCGGCCACG 960 

m GGGCCTGCCA CCATACCCAC GCCGAAACAA GCGCTCATGA GCCCGAAGTG GCGAGCCCGA 1020 

2 TCTTCCCCAT CGGTGATGTC GGCGATATAG GCGCCAGCAA CCGCACCTGT GGCGCCGGTG 108 0 

m ATGCCGGCCA CGATGCGTCC GGCGTAGAGG ATCGAGATCT CGATCCCGCG AAATTAATAC 1140 

]S GACTCACTAT AGGGGAATTG TGAGCGGATA ACAATTCCCC TCTAGAAATA ATTTTGTTTA 1200 

Ul ACTTTAAGAA GGAGATATAC ATATGGGC C A TCATCATCAT CATCACGTGA TCGACATCAT 1260 

* CGGGACCAGC CCCACATCCT GGGAACAGGC GGCGGCGGAG GCGGTCCAGC GGGCGCGGGA 1320 

L, TAGCGTCGAT GACATCCGCG TCGCTCGGGT CATTGAGCAG GACATGGCCG TGGACAGCGC 1380 

" CGGCAAGATC ACCTACCGCA TCAAGCTCGA AGTGTCGTTC AAGATGAGGC CGGCGCAACC 1440 

jj- GAGGGGCTCG AAACCACCGA GCGGTTCGCC TGAAACGGGC GCCGGCGCCG GTACTGTCGC 1500 

I™ GACTACCCCC GCGTCGTCGC CGGTGACGTT GGCGGAGACC GGTAGCACGC TGCTCTACCC 1560 

jf; GCTGTTCAAC CTGTGGGGTC CGGCCTTTCA CGAGAGGTAT CCGAACGTCA CGATCACCGC 1620 

O TCAGGGCACC GGTTCTGGTG CCGGGATCGC GCAGGCCGCC GCCGGGACGG TCAACATTGG 168 0 

PJ GGCCTCCGAC GCCTATCTGT CGGAAGGTGA TATGGCCGCG CACAAGGGGC TGATGAACAT 1740 

CGCGCTAGCC ATCTCCGCTC AGCAGGTCAA CTACAACCTG CCCGGAGTGA GCGAGCACCT 18 00 

CAAGCTGAAC GGAAAAGTCC TGGCGGC CAT GTAC CAGGGC ACCATCAAAA CCTGGGACGA 1860 

CCCGCAGATC GCTGCGCTCA ACCCCGGCGT GAACCTGCCC GGCACCGCGG TAGTTCCGCT 1920 

GCACCGCTCC GACGGGTCCG GTGACACCTT CTTGTTCACC CAGTACCTGT CCAAGCAAGA 1980 

TCCCGAGGGC TGGGGCAAGT CGCCCGGCTT CGGCACCACC GTCGACTTCC CGGCGGTGCC 2040 

GGGTGCGCTG GGTGAGAACG GCAACGGCGG CATGGTGACC GGTTGCGCCG AGACAC CGGG 2100 

CTGCGTGGCC TATATCGGCA TCAGCTTCCT CGACCAGGCC AGTCAACGGG GACTCGGCGA 2160 

GGCCCAACTA GGCAATAGCT CTGGCAATTT CTTGTTGCCC GACGCGCAAA GCATTCAGGC 222 0 

CGCGGCGGCT GGCTTCGCAT CGAAAACCCC GGCGAAC C AG GCGATTTCGA TGATCGACGG 2280 

GCCCGCCCCG GACGGCTACC CGATCATCAA CTACGAGTAC GCCATCGTCA ACAAC CGGC A 2340 

AAAGGACGCC GCCACCGCGC AGAC CTTGC A GGCATTTCTG CACTGGGCGA TCACCGACGG 2400 

CAACAAGGCC TCGTTCCTCG AC CAGGTTCA TTTCCAGCCG CTGCCGCCCG CGGTGGTGAA 2460 

GTTGTCTGAC GCGTTGATCG CGACGATTTC CAGCGCTGAG ATGAAGACCG ATGCCGCTAC 2520 

CCTCGCGCAG GAGGCAGGTA ATTTCGAGCG GATCTC CGGC GACCTGAAAA CCCAGATCGA 2 580 

CCAGGTGGAG TCGACGGCAG GTTCGTTGCA GGGCCAGTGG CGCGGCGCGG CGGGGACGGC 2640 

CGCCCAGGCC GCGGTGGTGC GCTTCCAAGA AGCAGCCAAT AAGCAGAAGC AGGAACTCGA 2700 

CGAGATCTCG ACGAATATTC GTCAGGCCGG CGTCCAATAC TCGAGGGCCG ACGAGGAGCA 276 0 

GCAGCAGGCG CTGTCCTCGC AAATGGGCTT TGGATTCAGC TTCGCGCTGC CTGCTGGCTG 2820 

GGTGGAGTCT GACGCCGCCC ACTTCGACTA CGGTTCAGCA CTCCTCAGCA AAACCACCGG 2880 

GGACCCGCCA TTTCC CGGAC AGCCGCCGCC GGTGGCCAAT GACACCCGTA TCGTGCTCGG 2 94 0 

CCGGCTAGAC CAAAAGCTTT ACGCCAGCGC CGAAGCCACC GACTCCAAGG CCGCGGCCCG 3 000 

GTTGGGCTCG GACATGGGTG AGTTCTATAT GCCCTACCCG GGCACCCGGA TCAACCAGGA 3 060 
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AACCGTCTCG CTYGACGCCA ACGGGGTGTC TGGAAGCGCG TCGTATTACG AAGTCAAGTT 3120 

CAGCGATCCG AGTAAGC CG A ACGGCCAGAT CTGGACGGGC GTAATCGGCT CGCCCGCGGC 3180 

GAACGCACCG GACGCCGGGC CCCCTCAGCG CTGGTTTGTG GTATGGCTCG GGACCGCCAA 3240 

CAACCCGGTG GACAAGGGCG CGGCCAAGGC GCTGGCCGAA TCGATCCGGC CTTTGGTCGC 3300 

CCCGCCGCCG GCGCCGGCCG GGGAAGTCGC TCCTACCCCG ACGACACCGA CACCGCAGCG 3 360 

GACCTTACCG GCCTGAGAAT TCTGCAGATA TCCATCACAC TGGCGGCCGC TCGAGCACCA 3420 

CCACCACCAC CACTGAGATC CGGCTGCTAA CAAAGCCCGA AAGGAAGCTG AGTTGGCTGC 3480 

TGCCACCGCT GAGCAATAAC TAGCATAACC CCTTGGGGCC TCTAAACGGG TCTTGAGGGG 3 540 

TTTTTTGCTG AAAGGAGGAA CTATATCCGG AT 3572 

(2) INFORMATION FOR SEQ ID NO: 343: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

( C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

M 

□ (ii) MOLECULE TYPE: peptide 

Q 

Q3 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 343: 



m Val Gin Phe Gin Ser Gly Gly Asp Asn Ser Pro Ala Val Tyr Xaa Xaa 

J- 1 5 10 15 

|jgj Asp Gly Xaa Arg 

20 



ry 
m 



(2) INFORMATION FOR SEQ ID NO: 344 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

* y (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:344: 

Thr Thr Val Pro Xaa Val Thr. Glu Ala Arg 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 345: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 345: 



Thr Thr Pro Ser Xaa Val Ala Phe Ala Arg 
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15 10 

(2) INFORMATION FOR SEQ ID NO: 346: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:346: 

Asp Ala Gly Lys Xaa Ala Gly Xaa Asp Val Xaa Arg 

1 5 JLO 

(2) INFORMATION FOR SEQ ID NO: 347: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:347: 

Thr Xaa Glu Glu Xaa Gin Glu Ser Phe Asn Ser Ala Ala Pro Gly Asn 

1 5 10 15 

Xaa Lys 



(2) INFORMATION FOR SEQ ID NO: 348: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:348: 
CTAGTTAGTA CTCAGTCGCA GACCGTG 

(2) INFORMATION FOR SEQ ID NO: 34 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 349: 

GCAGTGACGA ATTCACTTCG ACTCC - 25 
(2) INFORMATION FOR SEQ ID NO: 3 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2412 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

H= (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 350: 

Q 

Q CATATGGGCC ATCATCATCA TCATCACGTG ATCGACATCA TCGGGACCAG CCCCACATCC 60 

ffl TGGGAACAGG CGGCGGCGGA GGCGGTCCAG CGGGCGCGGG ATAGCGTCGA TGACATCCGC 12 0 

GTCGCTCGGG TCATTGAGCA GGACATGGCC GTGGACAGCG CCGGCAAGAT CACCTACCGC 180 

rfs ATCAAGCTCG AAGTGTCGTT CAAGATGAGG CCGGCGCAAC CGAGGGGCTC GAAACCACCG 24 0 

j£ AGCGGTTCGC CTGAAACGGG CGCCGGCGCC GGTACTGTCG CGACTACCCC CGCGTCGTCG 300 

yj CCGGTGACGT TGGCGGAGAC CGGTAGCACG CTGCTCTACC CGCTGTTCAA CCTGTGGGGT 360 

~~ CCGGCCTTTC ACGAGAGGTA TCCGAACGTC ACGATCACCG CTCAGGGCAC CGGTTCTGGT 42 0 

Q GCCGGGATCG CGCAGGCCGC CGCCGGGACG GTCAACATTG GGGCCTCCGA CGCCTATCTG 48 0 

pj TCGGAAGGTG ATATGGCCGC GCACAAGGGG CTGATGAACA TCGCGCTAGC CATCTCCGCT 54 0 

P^h CAGCAGGTCA ACTACAACCT GCCCGGAGTG AGCGAGCACC TCAAGCTGAA CGGAAAAGTC 60 0 

1^ CTGGCGGCCA TGTAC CAGGG CACCATCAAA ACCTGGGACG ACCCGCAGAT CGCTGCGCTC 66 0 

AACCCCGGCG TGAACCTGCC CGGCACCGCG GTAGTTCCGC TGCACCGCTC CGACGGGTCC 72 0 

GGTGACACCT TCTTGTTCAC CCAGTACCTG TCCAAGCAAG ATCCCGAGGG CTGGGGCAAG 780 

TCGCCCGGCT TCGGCACCAC CGTCGACTTC CCGGCGGTGC CGGGTGCGCT GGGTGAGAAC 84 0 

GGCAACGGCG GCATGGTGAC CGGTTGCGCC GAGACACCGG GCTGCGTGGC CTATATCGGC 900 

ATCAGCTTCC TCGACCAGGC CAGTCAACGG GGACTCGGCG AGGCCCAACT AGGCAATAGC 960 

TCTGGCAATT TCTTGTTGCC CGACGCGCAA AGCATTCAGG CCGCGGCGGC TGGCTTCGCA 102 0 

TCGAAAACCC CGGCGAACCA GGCGATTTCG ATGATCGACG GGCCCGCCCC GGACGGCTAC 1080 

CCGATCATCA ACTACGAGTA CGCCATCGTC AACAACCGGC AAAAGGACGC CGCCACCGCG 114 0 

CAGACCTTGC AGGCATTTCT GCACTGGGCG ATCACCGACG GCAACAAGGC CTCGTTCCTC 1200 

GACCAGGTTC ATTTCCAGCC GCTGCCGCCC GCGGTGGTGA AGTTGTCTGA CGCGTTGATC 1260 

GCGACGATTT CCAGCGCTGA GATGAAGACC GATGCCGCTA CCCTCGCGCA GGAGGCAGGT 132 0 

AATTTCGAGC GGATCTCCGG CGACCTGAAA AC CC AG ATCG ACCAGGTGGA GTCGACGGCA 13 80. 

GGTTCGTTGC AGGGC CAGTG GCGCGGCGCG GCGGGGACGG CCGCCCAGGC CGCGGTGGTG 144 0 

CGCTTCCAAG AAGCAGCCAA TAAGCAGAAG CAGGAACTCG ACGAGATCTC GACGAATATT 1500 

CGTCAGGCCG GCGTC CAATA CTCGAGGGCC GACGAGGAGC AGCAGCAGGC GCTGTCCTCG 1560 

CAAATGGGCT TTGTGCCCAC AACGGCCGCC TCGCCGCCGT CGACCGCTGC AGCGCCACCC 1620 

GCACCGGCGA CACCTGTTGC CCCCCCACCA CCGGCCGCCG CCAACACGCC GAATGC CCAG 1680 

CCGGGCGATC CCAACGCAGC ACCTCCGCCG GCCGACCCGA ACGCACCGCC GCCACCTGTC 174 0 

ATTGCCCCAA ACGCACCCCA AC CTGTC CGG ATCGACAACC CGGTTGGAGG ATTCAGCTTC 1800 

GCGCTGCCTG CTGGCTGGGT GGAGTCTGAC GCCGCCCACT TCGACTACGG TTCAGCACTC 1860 

CTCAGCAAAA CCACCGGGGA CCCGCCATTT CCCGGACAGC CGCCGCCGGT GGCCAATGAC 1920 

ACCCGTATCG TGCTCGGCCG GCTAGACCAA AAGCTTTACG CCAGCGCCGA AGCCACCGAC 1980 

TCCAAGGCCG CGGCCCGGTT GGGCTCGGAC ATGGGTGAGT TCTATATGCC CTACCCGGGC 2 04 0 

ACCCGGATCA ACCAGGAAAC CGTCTCGCTC GACGCCAACG GGGTGTCTGG AAGCGCGTCG 2100 

TATTACGAAG TCAAGTTCAG CGATCCGAGT AAGCCGAACG GCCAGATCTG GACGGGCGTA 216 0 



■ 5 



ni 
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ATCGGCTCGC CCGCGGCGAA CGCACCGGAC GCCGGGCCCC CTCAGCGCTG GTTTGTGGTA 2220 

TGGCTCGGGA CCGCCAACAA CCCGGTGGAC AAGGGCGCGG CCAAGGCGCT GGCCGAATCG 2280 

ATCCGGCCTT TGGTCGCCCC GCCGCCGGCG CCGGCACCGG CTCCTGCAGA GCCCGCTCCG 2340 

GCGCCGGCGC CGGCCGGGGA AGTCGCTCCT ACCCCGACGA CACCGACACC GCAGCGGACC 24 00 

TTACCGGCCT GA 2412 

(2) INFORMATION FOR SEQ ID NO: 351: 



o 

£0 



Q 
ly 
nj 

§ 

~r = 
ssss. 

u 

m 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 802 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 351: 



Met 


Glv 


His 


His 


His 


His 


His 


His 


Val 


He 


ASD 


He 


He 


Gly 


Thr 


Ser 


1 








5 










10 










15 




Pro 


Thr 


Ser 


Tro 


Glu 


Gin 


Ala 


Ala 


Ala 


Glu 


Ala 


Val 


Gin Arg Ala Arg 








20 










25 










30 






Asp 


Ser 


Val 


Asp 


Asp 


He 


Arcr 


Val 


Ala 


Ar"g 


Val 


He 


Glu 


Gin 


Asp 


Met 






35 










40 










45 








Ala 


Val 


Asp 


Ser 


Ala 


Glv 


Lys 


He 


Thr 


Tvr 
j 


Ai y 


He 


Lys 


Leu 


Glu 


Val 




50 










55 










60 










Ser 


Phe 


Lys 


Met 


Ara 


Pro 


Ala 


Gin 


Pro 




Glv 


Ser 


Lys 


Pro 


Pro 


Ser 


65 










70 










75 










80 


Gly 


Ser 


Pro 


Glu 


Thr 


Gly 


Ala 


Gly 


Ala 


Gly 


Thr 


Val 


Ala 


Thr 


Thr 


Pro 










85 










90 










95 




Ala 


Ser 


Ser 


Pro 


Val 


Thr 


Leu 


Ala 


Glu 


Thr 


Gly 


Ser 


Thr 


Leu 


Leu 


Tyr 








100 










105 










110 






Pro 


Leu 


Phe 


Asn 


Leu 


Trp 


Gly 


Pro 


Ala 


Phe 


His 


Glu 


Arg 


Tyr 


Pro 


Asn 






115 










120 










125 








Val 


Thr 


He 


Thr 


Ala 


Gin 


Gly 


Thr 


Gly 


Ser 


Gly 


Ala 


Gly 


He 


Ala 


Gin 




130 










135 










140 










Ala 


Ala 


Ala 


Gly 


Thr 


Val 


Asn 


He 


Gly 


Ala 


Ser 


Asp 


Ala 


Tyr 


Leu 


Ser 


145 










150 










155 










160 


Glu 


Gly 


Asp 


Met 


Ala 


Ala 


His 


Lys 


Gly 


Leu 


Met 


Asn 


He 


Ala 


Leu 


Ala 
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170 










175 




He 


Ser 


Ala 


Gin 


Gin 


Val 


Asn 


Tyr 


Asn 


Leu 


Pro 


Gly 


Val 


Ser 


Glu 


His 








180 










185 










190 






Leu 


Lys 


Leu 


Asn 


Gly 


Lys 


Val 


Leu 


Ala 


Ala 


Met 


Tyr 


Gin Gly 


Thr 


He 
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200 










205 








Lys 


Thr 


Trp 


Asp 


Asp 


Pro 


Gin 


He 


Ala 


Ala 


Leu 


Asn 


Pro 


Gly Val 


Asn 




210 










215 










220 










Leu 


Pro 


Gly 


Thr 


Ala 


Val 


Val 


Pro 


Leu 


His 


Arg 


Ser 


Asp Gly 


Ser 


Gly 


225 










230 










235 










240 


Asp 


Thr 


Phe 


Leu 


Phe 


Thr 


Gin 


Tyr 


Leu 


Ser 


Lys 


Gin 


Asp 


Pro 


Glu Gly 










245 










250 










255 




Trp 


Gly 


Lys 


Ser 


Pro 


Gly 


Phe 


Gly 


Thr 


Thr 


Val 


Asp 


Phe 


Pro 


Ala 


Val 








260 










265 










270 






Pro 


Gly 


Ala 


Leu 


Gly 


Glu 


Asn 


Gly 


Asn 


Gly 


Gly 


Met 


Val 


Thr 


Gly Cys 






275 










280 










285 








Ala 


Glu 


Thr 


Pro 


Gly 


Cys 


Val 


Ala 


Tyr 


He 


Gly 


He 


Ser 


Phe 


Leu 


Asp 
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290 295 300 

Gin Ala Ser Gin Arg Gly Leu Gly Glu Ala Gin Leu Gly Asn Ser Ser 
305 310 315 320 

Gly Asn Phe Leu Leu Pro Asp Ala Gin Ser lie Gin Ala Ala Ala Ala 

325 330 335 

Gly Phe Ala Ser Lys Thr Pro Ala Asn Gin Ala lie Ser Met lie Asp 

340 345 350 

Gly Pro Ala Pro Asp Gly Tyr Pro lie lie Asn Tyr Glu Tyr Ala lie 

355 360 365 

Val Asn Asn Arg Gin Lys Asp Ala Ala Thr Ala Gin Thr Leu Gin Ala 

370 375 380 

Phe Leu His Trp Ala lie Thr Asp Gly Asn Lys Ala Ser Phe Leu Asp 
385 390 395 400 

Gin Val His Phe Gin Pro Leu Pro Pro Ala Val Val Lys Leu Ser Asp 

405 410 415 

Ala Leu lie Ala Thr lie Ser Ser Ala Glu Met Lys Thr Asp Ala Ala 

420 425 430 

Thr Leu Ala Gin Glu Ala Gly Asn Phe Glu Arg lie Ser Gly Asp Leu 

435 440 445 

Lys Thr Gin lie Asp Gin Val Glu Ser Thr Ala Gly Ser Leu Gin Gly 

450 455 460 

Gin Trp Arg Gly Ala Ala Gly Thr Ala Ala Gin Ala Ala Val Val Arg 
465 470 475 480 

Phe Gin Glu Ala Ala Asn Lys Gin Lys Gin Glu Leu Asp Glu lie Ser 

485 490 495 

Thr Asn lie Arg Gin Ala Gly Val Gin Tyr Ser Arg Ala Asp Glu Glu 

500 505 510 

Gin Gin Gin Ala Leu Ser Ser Gin Met Gly Phe Val Pro Thr Thr Ala 

515 520 525 

Ala Ser Pro Pro Ser Thr Ala Ala Ala Pro Pro Ala Pro Ala Thr Pro 

530 535 540 

Val Ala Pro Pro Pro Pro Ala Ala Ala Asn Thr Pro Asn Ala Gin Pro 
545 550 555 560 

Gly Asp Pro Asn Ala Ala Pro Pro Pro Ala Asp Pro Asn Ala Pro Pro 

565 570 575 

Pro Pro Val lie Ala Pro Asn Ala Pro Gin Pro Val Arg lie Asp Asn 

580 585 590 

Pro Val Gly Gly Phe Ser Phe Ala Leu Pro Ala Gly Trp Val Glu Ser 

595 600 605 

Asp Ala Ala His Phe Asp Tyr Gly Ser Ala Leu Leu Ser Lys Thr Thr 

610 615 620 

Gly Asp Pro Pro Phe Pro Gly Gin Pro Pro Pro Val Ala Asn Asp Thr 
625 630 635 640 

Arg lie Val Leu Gly Arg Leu Asp Gin Lys Leu Tyr Ala Ser Ala Glu 

645 650 655 

Ala Thr Asp Ser Lys Ala Ala Ala Arg Leu Gly Ser Asp Met Gly Glu 

660 665 670 

Phe Tyr Met Pro Tyr Pro Gly Thr Arg lie Asn Gin Glu Thr Val Ser 

675 680 685 

Leu Asp Ala Asn Gly Val Ser Gly Ser Ala Ser Tyr Tyr Glu Val Lys 

690 695 700 

Phe Ser Asp Pro Ser Lys Pro Asn Gly Gin lie Trp Thr Gly Val lie 
705 710 715 720 

Gly Ser Pro Ala Ala Asn Ala Pro Asp Ala Gly Pro Pro Gin Arg Trp 
725 730 735 
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Phe Val Val Trp Leu Gly Thr Ala 
740 

Ala Lys Ala Leu Ala Glu Ser He 
755 760 
Ala Pro Ala Pro Ala Pro Ala Glu 

770 775 
Gly Glu Val Ala Pro Thr Pro Thr 
785 790 
Pro Ala 



Asn Asn Pro Val Asp Lys Gly Ala 
745 750 
Arg Pro Leu Val Ala Pro Pro Pro 
765 

Pro Ala Pro Ala Pro Ala Pro Ala 
780 

Thr Pro Thr Pro Gin Arg Thr Leu 
795 800 



(2) INFORMATION FOR SEQ ID NO: 352: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:352: 
GGATCCAAAC CACCGAGCGG TTCGCCTGAA ACGG 

(2) INFORMATION FOR SEQ ID NO: 3 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:353: 

CGCTGCGAAT TCACCTCCGG AGGAAATCGT CGCGATC 

(2) INFORMATION FOR SEQ ID NO: 354: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1962 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 354: 



CATATGGGCC AT CAT CATCA TCATCACGGA 
GGCGCCGGCG CCGGTACTGT CGCGACTACC 
ACCGGTAGCA CGCTGCTCTA CCCGCTGTTC 
TATCCGAACG TCACGATCAC CGCTCAGGGC 



TCCAAACCAC CGAGCGGTTC GCCTGAAACG 
CCCGCGTCGT CGCCGGTGAC GTTGGCGGAG 
AACCTGTGGG GTCCGGCCTT TCACGAGAGG 
ACCGGTTCTG GTGC CGGGAT CGCGCAGGCC 




w 
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GCCGCCGGGA CGGTCAACAT TGGGGCCTCC GACGC CTATC TGTCGGAAGG TGATATGGCC 300 

GCGCACAAGG GGCTGATGAA CATCGCGCTA GCCATCTCCG CTCAGCAGGT CAACTACAAC 36 0 

CTGCCCGGAG TGAGCGAGCA CCTCAAGCTG AACGGAAAAG TCCTGGCGGC CATGTACCAG 420 

GGCACCATCA AAACCTGGGA CGACCCGCAG ATCGCTGCGC TCAACCCCGG CGTGAACCTG 48 0 

CCCGGCACCG CGGTAGTTCC GCTGCACCGC TCCGACGGGT CCGGTGACAC CTTCTTGTTC 54 0 

ACCCAGTACC TGTCCAAGCA AGATCCCGAG GGCTGGGGCA AGTCGCCGGG CTTCGGCACC 600 

AC CGTCGACT TCCCGGCGGT GCCGGGTGCG CTGGGTGAGA ACGGCAACGG CGGCATGGTG 660 

ACCGGTTGCG CCGAGACACC GGGCTGCGTG GCCTATATCG GCATCAGCTT CCTCGACCAG 72 0 

GCCAGTCAAC GGGGACTCGG CGAGGCCCAA CTAGGCAATA GCTCTGGCAA TTTCTTGTTG 78 0 

CCCGACGCGC AAAGCATTCA GGCCGCGGCG GCTGGCTTCG CATCGAAAAC CCCGGCGAAC 840 

CAGGCGATTT CGATGATCGA CGGGCCCGCC CCGGACGGCT ACCCGATCAT CAACTACGAG 900 

TACGCCATCG TCAACAACCG GCAAAAGGAC GCCGCCACCG CGCAGACCTT GCAGGCATTT 960 

CTGCACTGGG CGATCACCGA CGGCAACAAG GCCTCGTTCC TCGACCAGGT TCATTTCCAG 1020 

CCGCTGCCGC CCGCGGTGGT GAAGTTGTCT GACGCGTTGA TCGCGACGAT TTCCTCCGGA 1080 

GGTGGCAGTG GGGGAGGCTC AGGTGGAGGT TCTGGCGGGA GCGTGCCCAC AACGGCCGCC 114 0 

TCGCCGCCGT CGACCGCTGC AGCGCCACCC GCACCGGCGA CACCTGTTGC CCCCCCACCA 12 00 

CCGGCCGCCG CCAACACGCC GAATGCCCAG CCGGGCGATC CCAACGCAGC ACCTCCGCCG 1260 

^ GCCGACCCGA ACGCACCGCC GCCACCTGTC ATTGCCCCAA ACGCACCCCA ACCTGTCCGG 132 0 

j_ ATCGACAACC CGGTTGGAGG ATTCAGCTTC GCGCTGCCTG CTGGCTGGGT GGAGTCTGAC 138 0 

S GCCGCCCACT TCGACTACGG TTCAGCACTC CTCAGCAAAA CCACCGGGGA CCCGCCATTT 1440 

« CCCGGACAGC CGCCGCCGGT GGCCAATGAC AC C CGTATCG TGCTCGGCCG GCTAGACCAA 1500 

*SJ AAGCTTTACG CCAGCGCCGA AGCCACCGAC TCCAAGGCCG CGGCCCGGTT GGGCTCGGAC 156 0 

7l ATGGGTGAGT TCTATATGCC CTACCCGGGC ACCCGGATCA ACCAGGAAAC CGTCTCGCTC 162 0 

^ 3 GACGCCAACG GGGTGTCTGG AAGCGCGTCG TATTACGAAG TCAAGTTCAG CGATC CGAGT 168 0 

AAGCCGAACG GCCAGATCTG GACGGGCGTA ATCGGCTCGC CCGCGGCGAA CGCACCGGAC 1740 

GCCGGGCCCC CTCAGCGCTG GTTTGTGGTA TGGCTCGGGA CCGCCAACAA CCCGGTGGAC 18 0 0 

AAGGGCGCGG CCAAGGCGCT GGCCGAATCG ATCCGGCCTT TGGTCGCCCC GCCGCCGGCG 186 0 

CCGGCACCGG CTCCTGCAGA GCCCGCTCCG GCGCCGGCGC CGGCCGGGGA AGTCGCTCCT 1920 

ACCCCGACGA CACCGACACC GCAGCGGACC TTACCGGCCT GA 1962 

ru 

Ul (2) INFORMATION FOR SEQ ID NO: 3 55: 

Q 

fy (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 652 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 55: 
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Ser 
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Phe 


Asn 


Leu 


Trp 


Gly 


Pro 


Ala 


Phe 


His 


Glu 


Arg 
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Pro 
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Val 


Thr 
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Gly 
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Gly 
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Gly 


Ala 


Gly 


He 


Ala 


Gin 


Ala 


Ala 
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Gly 
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Val 
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Gly 


Ala 


Ser 
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Ala 


Tyr 


Leu 


Ser 


Glu 
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100 

Ala Gin Gin Val 
115 

Leu Asn Gly Lys 
130 

Trp Asp Asp Pro 
145 

Gly Thr Ala Val 

Phe Leu Phe Thr 
180 

Lys Ser Pro Gly 
195 

Ala Leu Gly Glu 
210 

Thr Pro Gly Cys 
225 

Ser Gin Arg Gly 

Phe Leu Leu Pro 
260 

Ala Ser Lys Thr 
275 

Ala Pro Asp Gly 
290 

Asn Arg Gin Lys 
305 

His Trp Ala lie 

His Phe Gin Pro 
340 

lie Ala Thr lie 
355 

Gly Ser Gly Gly 
370 

Ala Ala Ala Pro 
385 

Ala Ala Ala Asn 

Pro Pro Pro Ala 
420 

Asn Ala Pro Gin 
435 

Phe Ala Leu Pro 
4 50 

Tyr Gly Ser Ala 
465 

Gly Gin Pro Pro 

Leu Asp Gin Lys 
500 

Ala Ala Arg Leu 
515 

Gly Thr Arg lie 
530 



Asn Tyr Asn Leu 
120 

Val Leu Ala Ala 
135 

Gin lie Ala Ala 
150 

Val Pro Leu His 
165 

Gin Tyr Leu Ser 

Phe Gly Thr Thr 
200 

Asn Gly Asn Gly 
215 

Val Ala Tyr Tie 
230 

Leu Gly Glu Ala 
245 

Asp Ala Gin Ser 

Pro Ala Asn Gin 
280 

Tyr Pro lie lie 
295 

Asp Ala Ala Thr 
310 

Thr Asp Gly Asn 
325 

Leu Pro Pro Ala 

Ser Ser Gly Gly 
360 

Ser Val Pro Thr 
375 

Pro Ala Pro Ala 
390 

Thr Pro Asn Ala 
405 

Asp Pro Asn Ala 

Pro Val Arg lie 
440 

Ala Gly Trp Val 
455 

Leu Leu Ser Lys 
470 

Pro Val Ala Asn 
485 

Leu Tyr Ala Ser 

Gly Ser Asp Met 
520 

Asn Gin Glu Thr 
535 
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Asp 
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Asp 
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Val 


Asp 


Phe 


Pro 


Gly Met Val 


Thr 
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Ser 
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Gly 


Gly 
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Val 
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Gly 
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Pro 


Pro 
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Asp 


Asn 


Pro 


Val 


Glu 


Ser 


Asp 


Ala 
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Gly 
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Asp 


Thr 


Arg 


He 
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Ala 


Glu 


Ala 


Thr 
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Gly Glu 


Phe 


Tyr 


Val 


Ser 


Leu 


Asp 
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110 

Glu His Leu Lys 
125 

Thr He Lys Thr 

Val -Asn Leu Pro 
160 

Ser Gly Asp Thr 
175 

Glu Gly Trp Gly 
190 

Ala Val Pro Gly 
205 

Gly Cys Ala Glu 

Xieu Asp Gin Ala 
240 

Ser Ser Gly Asn 
255 

Ala Ala Gly Phe 
270 

He Asp Gly Pro 
285 

Ala He Val Asn 

Gin Ala Phe Leu 
320 

Leu Asp Gin Val 
335 

Ser Asp Ala Leu 
350 

Gly Ser Gly Gly 
365 

Pro Pro Ser Thr 

Pro Pro Pro Pro 
400 

Pro Asn Ala Ala 
415 

Val He Ala Pro 
430 

Gly Gly Phe Ser 
445 

Ala His Phe Asp 

Pro Pro Phe Pro 
480 

Val Leu Gly Arg 
495 

Asp Ser Lys Ala 
510 

Met Pro Tyr Pro 
525 

Ala Asn Gly Val 
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Val Lys 


Phe Ser Asp 


Pro 
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Trp Thr Gly Val He 


Gly Ser Pro Ala Ala Asn 
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Phe Val Val 


Trp 


Leu 
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Val 


Asp 
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Gly Ala 


Ala Lys Ala 


Leu 
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Pro 
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Gly Glu Val 
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Pro 
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